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PREFACE 


Mixing  and  Stirring  took  place  at  the  1999  Geophysical  Fluid  Dynamics  Summer  Study 
Program.  William  R.  Young  (Scripps  Institution  of  Oceanography)  gave  the  principal  lectures, 
with  Ray  Pierre-Humbert  (the  University  of  Chicago)  adding  two  special  lectures  (only  one  was 
scheduled;  the  second  was  given  at  the  request  of  the  audience).  These  lectures  served  us  an  . 
introduction  to  the  general  physical  principles  of  the  subject  and  its  many  applications  in  a  large 
number  of  fields,  and  provided  us  with  essential  mathematical  technology. 

As  usual,  we  had  a  variety  of  seminars  throughout  the  weeks  following  the  lectures,  covering  the 
many  subjects  in  which  mixing  and  stirring  plays  a  role.  Week  4,  organized  by  Jim  Ledwell 
(Woods  Hole  Oceanographic  Institution)  with  great  success,  focussed  the  attention  of  the 
participants  on  the  oceanic  problem.  Several  visitors  came  to  participate  solely  in  this  week,  and 
we  saw  many  familiar  faces  from  past  summers. 

This  year  was  also  the  $40th  birthday  of  the  Program.  In  celebration,  the  Oceanographic 
Institution  graciously  held  a  picnic  for  all  the  participants,  for  the  people  at  Woods  Hole 
connected  to  the  Program,  and  for  any  fellows  from  previous  summers  that  were  able  to  visit  for 
an  afternoon.  George  Veronis  gave  a  memorable  speech  with  a  historical  perspective.  The 
picnic  ended  with  a  customary  sight  -  a  GFD  softball  game  -  and  the  afternoon,  thanks  to 
W.H.O.I.,  was  a  very  pleasant  experience. 

This  year's  fellows  proved  to  be  a  group  that  meshed  together  especially  well  (including  group 
triathlons  each  morning  for  the  hardier  fellows).  Their  academic  accomplishments  can  be 
viewed  elsewhere  in  this  volume.  One  notable  feature  of  their  efforts  is  that  they  are  all,  pretty 
uniformly,  commendable  projects;  sometimes,  it  has  to  be  admitted  that  the  program  doesn't 
work  for  everyone,  but  this  year,  I  think  it  worked  as  well  as  it  could.  Consequently,  I  think  this 
summer  was  notably  successful. 

Special  thanks  go  to  Bill  Young  for  his  tireless  efforts  in  preparing  and  giving  the  principal 
lectures,  and  for  advising  so  many  of  the  fellows.  This  summer  could  not  have  been  so 
successful  without  Bill.  Also,  Eric  Chassignet  and  Glenn  Flierl  spent  many  selfless  hours  with 
the  computers,  and  Jack  Whitehead  dealt  magnificently  with  many  administrative  matters 
throughout  the  whole  year  preceding  the  Program.  Jean-Luc  Thiffeault  and  Claudia  Pasquero 
must  be  thanked  for  their  important  contributions  to  creating  this  volume. 

And  last,  but  by  no  means  least,  I  thank  W.H.O.I.  Education,  who  continue  to  provide  a  perfect 
atmosphere  in  which  to  run  the  program.  I  specially  thank  Marcey  Simon  for  all  her  efforts  to 
organize  the  program,  and  our  two  staff  assistants,  Veta  Green  and  Janet  Fields,  who  stepped  in 
to  replace  the  veteran,  Lee  Campbell,  when  she  moved  off  to  other  enterprises. 


N.  J.  Balmforth 
Director 
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Lecture  1:  Overview 


1  Diffusion  by  discontinous  movements 

In  1827  Robert  Brown,  observed  that  suspended  pollen  grains  are  in  an  uninterrupted  and  irregular 
“swarming”  motion.  Brown  was  a  botanist  and  at  first  he  believed  that  only  organic  materials 
exhibited  this  agitation.  But  very  soon  he  extended  his  observations  to  particles  of  inorganic  material, 
such  as  a  ground-up  fragment  of  the  Sphinx.  Through  the  nineteenth  century  there  was  a  intermittent 
discussion  concerning  the  cause  of  this  Brownian  motion ,  and  in  1877  Delsaux  suggested  that  the 
impact  of  molecules  on  a  macroscopic  particle  produces  observable  displacements.  In  1905,  after 
nearly  a  century  of  debate,  Einstein  definitively  explained  this  phenomenon  [6,7]. 


1.1  Einstein’s  derivation  of  the  diffusion  equation 

Our  interest  here  is  in  Einstein’s  derivation  of  the  diffusion  equation,  which  is  very  different  from  that 
of  Fourier.  We  consider  one-dimensional  Brownian  motion  by  projecting  the  location  of  the  particle 
onto  a  straight  line  which  we  call  the  x-axis. 

Einstein’s  assumptions  are  the  following:  (i)  the  particles  move  independently  of  one  another; 
(ii)  we  observe  particle  positions  at  time  intervals  r  which  are  much  greater  than  the  time  intervals 
between  molecular  collisions.  As  a  result,  the  motion  in  one  interval  is  independent  of  what  happened 
in  the  previous  interval. 

In  the  interval  r  each  particle  has  a  random  displacement  A  along  the  x-axis.  The  probability 
density  function  (PDF)  of  A  is  <j>( A).  This  means  that  if  we  observe  N  >  1  particles  for  a  time  r  then 
the  number  of  particles  which  are  displaced  through  a  distance  which  lies  between  A  and  A  +  dA  is 

dN  =  N<t>(A)  dA.  (1) 


The  PDF  <f>( A)  does  not  change  from  interval  to  interval,  and  <j>  is  symmetric  and  normalised: 


rtA)  =  *(-A), 


/: 


0(A)  dA  =  1. 


(2) 


The  symmetry  of  0  implies  that  the  displacements  are  unbiased.  The  average  of  any  function  of  A, 
/(A),  is 


/(A)  0(A)  dA . 


(3) 


In  particular,  A2  is  the  mean  square  displacement  in  a  single  step. 

If  the  concentration  of  particles  at  time  t  is  denoted  by  c(x,  £),  then  the  evolution  of  c  is  determined 
from  the  master  equation : 


r  OO 

c(x,t  +  r)  =  I  c(x  —  A,f)0(A)  dA . 


(4) 


1 


A  random  walk  with  200  steps 


Figure  1:  Simulated  Brownian  motion  using  MATLAB;  the  routine  rand  is  used  to  generate  a  sequence  of 
200  random  displacements. 


The  integral  over  A  is  a  sum  over  the  prior  locations  at  time  t  of  the  particles  that  are  at  x  at  time 
t  +  r.  Thus,  the  number  of  particles  in  the  interval  (x  -  A,  a:  -  A  +  dA)  is  c(x  -  A,f)dA  and  <f>{ A) 
is  the  fraction  of  these  particles  which  jump  from  x  -  A  onto  x. 

If  the  concentration  c(x,f )  changes  on  a  length  scale  which  is  much  greater  than  the  root  mean 
square  displacement,  then  we  can  approximate  the  integral  equation  (4)  by  the  diffusion  equation. 
This  assumption  that  c  is  slowly  varying  means  that  it  it  is  sensible  to  use  a  Taylor  series  expansion 


c(x,f)  +  TCt(x,t) 


/OO 

4>(  A) 

-OO 


A2 

C(X,  t)  A Cx(x,t)  4"  ^  Cxxfait') 


dA. 


(5) 


Next,  using  (2),  we  reduce  (5)  to 


ct(x,t)  «  Dc  XX  (x, t ) , 


(6) 
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This  is  the  diffusion  equation,  and  D  is  the  diffusivity. 

The  greatness  of  Einstein’s  contribution  to  this  subject  is  not  the  derivation  above  but  rather  his 
formula  for  the  diffusivity  of  a  macroscopic  particle 


JD  = 


RT 

faNava  ’ 


(7) 


where  R  is  the  gas  constant,  T  the  absolute  temperature,  iVa  the  Avogadro  number,  v  the  coefficient 
of  viscosity  and  a  the  radius  of  the  particle.  Coincidentally,  (7)  was  also  discovered  in  1905  by  William 
Sutherland  in  Australia.  This  relation  enabled  Perrin  to  determine  Avogadro’s  number  by  observing 
Brownian  displacements  [7]. 

The  diffusion  equation  is  an  approximation  of  the  more  exact  master  equation.  As  we  try  to  design 
parameterizations  of  nonlocal  mixing  processes,  in  which  scale  separation  assumptions  are  shaky,  we 
should  pay  more  attention  to  this  history  and  consider  the  possibility  of  using  integral  equations  such 
as  (4).  Notice  also  that  if  the  Taylor  expansion  in  (5)  is  continued  to  higher  order  then  one  will  usually 
(i.e.  for  most  kernels  <j>)  obtain  a  hyperdiffusive  term  such  as  cxxxx. 


1.2  The  method  of  moments 

As  a  check  on  the  derivation  of  (6),  we  take  a  different  approach  using  the  method  of  moments.  A 
moment  of  the  concentration  is  an  integral  of  the  form 


/: 


xnc(x,  t)  dx. 


(8) 


The  zeroth  moment,  n  =  0  in  (8),  is  the  total  number  of  particles: 


/: 


c(x,t )  da:. 


(9) 


The  first  and  second  moments  can  be  interpreted  as  the  center  of  mass  and  moment  of  inertia  of  the 
concentration  profile. 

We  expect  that  N  is  constant,  and  it  is  educational  to  verify  this  conservation  law  for  both  the 
master  equation  and  the  diffusion  equation  by  “talcing  the  zeroth  moment”.  Integrating  (4)  from 
x  =  -ootoa;  =  +oo,  and  changing  the  order  of  the  integrals  on  the  right-hand  side  gives 


N{t  +  r) 


/OO  r  OO 

dA<£(A)  /  dxc(x  —  A,t). 

-OO  J  —  OO 


(10) 


Changing  variables  to  xf  =  x  -  A  in  the  inner  integral,  and  using  (2),  gives  the  particle  conservation 
law  N(t  +  r)  =  N(t).  The  diffusive  analog  of  particle  conservation  is  easily  obtained  by  integrating  the 
diffusion  equation  (6)  from  x  =  -oo  to  x  =  -boo.  Provided  that  Dcx  vanishes  at  x  =  dboo  (physically, 
there  is  no  flux  of  particles  from  infinity),  one  immediately  finds  that  Nt  =  0. 

Extending  the  procedure  above  to  higher  moments,  we  can  make  a  comparison  between  the  exact 
results  for  Jxncdx  and  the  diffusive  approximation  of  these  same  integrals.  To  take  the  first  moment 
of  the  diffusion  equation,  multiply  (6)  by  x  and  integrate  from  x  =  — oo  to  x  =  -boo.  Once  again,  we 
use  integration  by  parts  and  assume  that  terms  such  as  xcx  and  c  vanish  as  x  ±oo.  Thus  we  find 
that  the  center  of  mass  is  stationary 

j  r  OO 

—  /  xc(r,t)dx  =  0.  (11) 

dt  J —co 

The  same  result  can  be  obtained  by  taking  the  first  moment  of  the  master  equation.  The  center  of 
mass  is  stationary  because  in  (2)  we  assume  that  the  PDF  of  displacements  is  symmetric. 
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Continuing,  we  come  to  the  second  moment.  For  the  diffusion  equation  we  obtain 


d_ 
d  t 


cdx  =  2D 


L 


oo 

cdx  =  2  DN, 

-oo 


(12) 


where,  as  before,  the  terms  which  fall  outside  the  integration  by  parts  are  zero  because  of  the  rapid 
decay  of  c  as  x  ->  ±oo.  The  student  should  show  that  from  the  master  equation 


/OO  rOO  /•CO 

x2c(x,t  +  r)dx  —  x2c(x,t)  dx  =  /  A20(A)dA. 
-oo  J  —  oo  J  — oo 


Recalling  the  definition  of  the  diffusivity  in  (6),  we  see  that  in  the  limit  r  0  the  difference  equation 
in  (13)  can  be  approximated  by  the  differential  equation  in  (12). 

The  law  in  (12),  that  the  mean  square  displacement  of  a  cloud  of  particles  grows  linearly  with  time, 
is  often  taken  to  be  the  defining  characteristic  of  diffusion.  As  we  will  see  later,  there  are  dispersive 
processes  which  have  other  power-laws,  such  as  Jx2c  dx  <x  t1/2.  These  processes  are  referred  to  as 
“anomalous  diffusion” . 


2  Diffusion  by  continuous  movements 

2.1  Lagrangian  time  series 

In  1922  Taylor  [11]  analyzed  the  diffusing  power  of  a  velocity  field.  The  basic  concept  here  is  that  of 
a  Lagrangian  time  series,  such  as  the  x- velocity  of  a  tagged  fluid  particle,  u(t ),  as  a  function  of  time. 
This  data  is  Lagrangian  (i.e.,  following  a  “float”),  not  Eulerian  (i.e,  obtained  from  a  “current  meter” 
fixed  in  space).  The  velocity  time  series  might  look  like  figure  2.  Clearly  there  is  some  regularity: 
evenly  spaced  maxima  and  minima  are  obvious,  and  we  might  guess  that  there  is  a  wave  which  is 
producing  oscillatory  displacements.  At  the  same  time,  the  velocity  is  not  completely  predictable, 
and  there  is  no  obvious  law  by  which  we  can  anticipate  all  details  of  the  future  using  observations  of 
the  past. 

The  simplest  assumption  we  can  make  to  analyze  the  process  in  figure  2  is  that  the  velocity  is 
statistically  stationary.  This  means  that  average  properties  of  the  velocity,  such  as  the  mean  square 
velocity,  are  not  changing  with  time.  In  operational  terms,  the  assumption  of  stationarity  means 
that  if  we  take  nonoverlapping  and  well-separated  subsamples  of  the  time  series  in  figure  2  then  the 
statistical  properties  of  the  subsamples  are  identical. 

If  the  time  series  is  long  enough  we  can  chop  it  into  N  chunks,  each  of  length  T.  We  define  an 
ensemble  average  by  considering  each  of  the  N  chunks  as  a  single  realization  of  a  random  process. 
This-  procedure  introduces  the  additional  assumptions  that  there  is  a  decorrelation  time  r«T,  and 
that  time  averages  are  equivalent  to  ensemble  averages.  Thinking  of  dispersion,  Taylor  imagined  that 
each  chunk  was  an  independent  particle,  labeled  n  =  1,2,...,  N,  executing  continuous  movements. 
“Continuous”  in  this  context  means  that  the  velocity  of  particle  n,  un(£),  is  a  relatively  smooth 
function  of  time,  at  least  in  comparison  with  the  jittery  motion  in  figure  1. 

We  denote  the  position  of  particle  n  by  xn(t),  so  that  if  all  the  particles  begin  at  x  =  0  then 

^  =«»(*),  =*  xn(t)=  Jun(t')dt'.  (14) 

We  use  angular  brackets  ()  to  denote  the  ensemble  average.  As  an  example  of  this  notation,  the 
average  velocity  of  the  N  particles  is 

1  N 

<«)  =  -£>„(*).  (15) 

n=l 
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A  time  series  of  Lagrangian  velocity 


Figure  2:  A  time  series  with  a  spectral  peak. 


Because  of  the  stationarity  assumption,  (u)  is  independent  of  time,  and  we  can  refer  all  displacements 
relative  to  the  position  of  the  center  of  mass  by  writing  x*  =  x  -  (u)t  and  u*n  =  un  -  (u).  To  save 
decorating  all  our  subsequent  x’s  and  it’s  with  primes  we  now  assume  that  (u)  =  0. 


2.2  Taylor’s  formula 

The  simplest  measure  of  dispersion  about  the  center  of  mass  is  the  mean  square  displacement,  (x2). 
We  can  calculate  the  rate  of  change  of  this  quantity  by  first  noting  that: 

Hr2  dr2  f 1 

^  =  2znun,  and  (14)  =»  ^  =  2  Jun(t)un(t')  d t’ .  (16) 

We  now  ensemble  average  (16).  Because  of  stationarity,  (u(t)u(tr))  depends  only  on  the  time  difference 
t  ~  t*.  Thus,  we  introduce  the  correlation  function 


C(t  -  t*)  =  (u(t)u(t')), 


(17) 


and.  after  a  change  of  variables,  write  the  ensemble  average  of  (16)  as 


d(s2) 

d  t 


(18) 


Equation  (IS)  is  Taylor’s  formula,  which  relates  the  variance  in  particle  displacement  (x2)  to  an 
integral  of  the  Lagrangian  velocity  autocorrelation  function  C(i). 

In  the  simplest  situations  the  correlation  function  C(t)  decreases  rapidly  to  zero  as  t  — >  oo  so 
that  the  integral  in  (18)  converges.  In  this  case,  the  dispersion  of  the  ensemble  at  large  times  is 
characterized  by  a  diffusivity  (x2)  ~  2 Dt,  where  the  diffusivity  D  is  related  to  the  correlation  function 
by: 


D  = 


(19) 


In  statistical  physics,  (19)  is  known  as  the  Green-Kubo  formula. 

Taylor  did  not  claim  that  turbulent  dispersion  was  governed  by  the  diffusion  equation,  (6).  We 
will  return  to  this  point  later.  For  the  moment  notice  that  (6)  is  an  approximation  valid  only  for 
sufficently  long  times  that  the  integral  in  (18)  has  converged  to  the  constant  D.  This  restriction  is 
related  to  Einstein’s  assumption  that  particle  positions  are  observed  at  time  intervals  r  which  are 
much  greater  than  the  decorrelation  time. 
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3  Diffusion  and  anomalous  diffusion 


In  the  previous  sections  we  emphasized  that  the  diffusion  equation  (6)  is  only  valid  on  times  long 
compared  to  the  decorrelation  time  r,  and  only  if  the  concentration  c(x,t)  varies  on  length  scales 
greater  than  the  width  of  the  density  0(A).  These  assumptions  of  scale  separation  in  both  time  and 
space  are  often  not  satisfied  in  real  flows.  Thus,  dispersion  experiments  over  the  last  ten  years  have 
revealed  behaviours  which  axe  much  richer  than  those  suggested  by  the  arguments  of  Einstein  and 
Taylor.  Experiments  often  show  that  the  growth  of  variance  is  described  by  a  power  law 

(x2)oc  (20) 

In  some  cases  £  =  1  (diffusion),  but  sometimes  £  ^  1,  in  which  case  the  process  is  referred  to  as 
anomalous  diffusion. 


3.1  Rayleigh-Benard  convection 

As  an  example  of  hydrodynamic  diffusion  (£  =  1)  and  transient  subdiffusion  (£  =  2/3)  we  mention  the 
experiments  of  Solomon  and  Gollub  [9,8]  on  the  dispersion  of  passive  scalar  (either  methylene  blue  or 
uranine  dye,  or  small  latex  spheres)  along  a  chain  of  Rayleigh-Benard  convection  cells  (see  figure  3). 
We  refer  to  the  passive  scalar  generically  as  “tracer” . 

Following  the  experimental  procedure  in  figure  3,  suppose  that  all  of  the  tracer  is  initially  released 
in  a  single  cell.  The  main  question  is:  how  many  cells,  N(t),  have  been  invaded  by  tracer  at  time 
tl  If  this  dispersive  process  is  described  by  diffusion  then  we  expect  that  N(t)  oc  t1/2.  With  certain 
interesting  restrictions,  this  t1/2-law  is  the  experimental  result. 

The  Rayleigh-Benard  flow  can  be  approximately  described  using  a  two-dimensional  and  incom¬ 
pressible  velocity  field,  (u,u),  obtained  from  the  streamfunction 

rp  =  AT1  A  sin  [fc  (x  +  B  sin  ut)]  W(z) ,  (u,v)  =  *  (21) 


The  parameter  A  controls  the  amplitude  of  the  flow,  k  =  2'k/X  is  the  wavenumber,  and  W(z )  is  a 
function  which  satisfies  the  no-slip  boundary  conditions  at  z  =  0  and  z  =  H.  The  term  Bsinut  is  a 
simple  model  of  the  lateral  oscillation  of  the  roll  pattern  which  results  from  an  instability  which  occurs 
when  the  convection  is  driven  sufficiently  strongly.  Because  the  flow  in  (21)  is  simple,  highly  structured 
and  deterministic,  this  is  not  an  example  of  turbulent  dispersion.  Nonetheless,  the  experimental  results 
can  be  summarized  using  the  notion  of  an  effective  diffusivity. 

The  Peclet  number  is 


P  = 


A_ 

kK 


(22) 


where  k  is  the  molecular  diffusivity  of  the  tracer,  is  a  nondimensional  parameter  which  measures  the 
importance  of  molecular  diffusivity  to  advection.  The  Peclet  number  can  be  considered  as  the  ratio 
of  the  time  it  takes  a  molecule  to  orbit  around  a  convection  cell  to  the  diffusion  time  across  a  cell.  In 
the  experiments  described  here,  P  is  large  and  molecules  make  many  circuits  around  a  convection  cell 
before  Brownian  motion  jostles  them  through  a  distance  as  large  as  k~1 . 

There  are  two  cases  which  must  be  carefully  distinguished: 


Steady  rolls  The  rolls  are  steady  if  either  cj  =  0  or  B  —  0  in  (21).  In  either  case,  tracer  can  pass 
from  one  roll  to  a  neighbour  only  via  molecular  diffusion.  But,  because  molecules  are  advected 
through  a  distance  fc-1,  the  dye  is  transported  along  the  array  of  cells  with  an  effective  diffusivity 
Deff  oc  yj Ak f  k  »  k.  Because  Deff  -4  0  if  k  -4  0,  the  transport  is  limited  by  molecular  diffusion. 

Unsteady  rolls  If  B  and  uj  are  both  nonzero  then  advection  (rather  than  molecular  diffusion)  can 
take  particles  through  the  time-averaged  position  of  the  cell  boundaries.  In  this  case,  there  is 
the  possibility  of  transport  unlimited  by  weak  molecular  diffusion. 
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Figure  3:  Transport  of  uranine  dye  along  an  array  of  convection  cells  with  kB  =  0.12;  time  (from  the  top): 
1,  2,  4  and  10  periods  of  oscillation.  (Figure  courtesy  of  Tom  Solomon  [10].) 


In  the  unsteady  case,  Solomon  and  Gollub  show  that  trajectories  of  particles  computed  with  the 
model  streamfunction  (21)  are  similar  to  the  patterns  observed  experimentally.  In  both  the  numerics 
and  the  experiments,  provided  that  c oB  ^  0,  the  transport  of  particles  along  the  array  of  cells  (in  the 
z-direction)  is  due  to  chaotic  advection  in  the  neighbourhood  of  the  roll  boundaries.  This  process  is 
strikingly  shown  in  figure  3. 

A  rough  summary  of  the  results  is  that  in  both  the  steady  and  the  unsteady  cases  the  dye  spreads 
via  a  one-dimensional  diffusive  process,  £  =  1  in  (20),  with  a  local  effective  diffusivity  Deff-  The 
number  of  invaded  cells  is  N(t)  oc  In  the  unsteady  case  Deff  is  independent  of  the  molecular 

diffusivity  k,  while  in  the  steady  case  £>eff  oc  The  effective  diffusivity  in  the  unsteady  case  is 
enhanced  by  1  to  3  orders  of  magnitude  over  the  effective  diffusivity  of  the  steady  case  (which  in  turn 
is  much  greater  than  the  molecular  diffusivity,  k). 

The  summary  in  the  previous  paragraph  omits  many  interesting  details.  One  of  the  more  important 
caveats  is  that  the  effective  diffusivity  in  the  steady  case  only  describes  the  dispersion  process  at  very 
long  times: 


N  (t)  oc  t1/2  when  t  .  (23) 

The  time  l/k2K  is  an  estimate  of  the  time  taken  for  molecular  diffusion  to  transport  tracer  through 
a  distance  of  order  fe"1,  from  the  edge  of  a  cell  to  the  center1.  In  this  long  time  limit,  the  evolution 

1  We  assume  that  the  aspect  ratio  of  the  cells  is  of  order  unity,  kH  =  0(1). 
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of  the  tracer  is  slower  than  the  intracellular  diffusion  time  \/k2K  and  consequently  the  concentration 
is  uniform  within  each  roll.  The  concentration  changes  rapidly  at  diffusive  boundary  layers  (with 
thickness  proportional  to  k1/2)  which  are  located  at  the  roll  boundaries.  The  intercellular  flux  across 
these  boundary  layers  is  responsible  for  the  spread  of  the  tracer  from  one  roll  to  the  next. 

The  scenario  described  above  does  not  have  time  to  become  established  until  t  »  1  /k2n.  When 
t  <  l/k2n  there  is  still  a  significant  dispersion  of  tracer  through  many  cells  which  is  described  by  the 
anomalous  diffusion  law 

iV^ocf1/3  when  t  <  tJ-  .  (24) 

K  tZ 

The  anomalous  process  above  relies  on  molecular  diffusion  passing  tracer  quickly  across  the  cell  bound¬ 
aries  before  there  has  been  time  to  reach  the  center  of  newly  invaded  cells  [3,4, 13].  Thus  there  is  a 
transient  regime  of  subdiffusion  which  preceeds  the  final  asymptotic  diffusive  law  in  (23). 

3.2  Anomalous  diffusion  in  two-dimensional  turbulence 

Cardoso  et  al.  [2]  conducted  an  experimental  study  of  dispersion  in  a  quasi-two-dimensional  turbulent 
flow.  The  experimental  apparatus  is  a  shallow  pan  of  fluid,  30cm  by  30cm,  and  3mm  deep.  The 
pan  is  filled  with  salty  water  and  flow  is  driven  electromagnetically  (E  x  B  forcing).  The  forcing  is 
arranged  so  that  the  basic  flow  is  a  square  lattice  of  30  x  30  counter-rotating  vortices.  This  flow  is 
almost  two-dimensional  because  of  the  large  disparity  between  the  horizontal  dimensions  (30  cm)  and 
the  vertical  dimension  (3  mm). 

Although  the  forcing  produces  a  regular  array  of  vortices,  this  simple  pattern  is  unstable  and  a 
two-dimensional  turbulent  flow  emerges.  Visualization  of  the  turbulence,  using  tracer  particles,  shows 
that  in  the  statistically  equilibrated  state  there  is  a  population  of  vortices  whose  size  is  two  or  three 
times  the  injection  scale  of  the  forcing.  Each  vortex  emerges,  moves,  merges  with  other  vortices,  and 
eventually  disappears. 

Cardoso  et  al.  [2]  injected  dye  into  this  vortex  mess  and  observed  the  two-dimensional  dispersion 
of  the  dye  in  the  horizontal  plane.  To  measure  the  growth  of  the  dye  blob,  they  defined 

Rm  =  +  y2c{x,y,t)  dxdy  j  Jc(x,y,t)dxdy ,  (25) 

and 


Rg  =  y  J  (x2  +  y2)  c(x,  y,  t )  dxdy  j  jc c(x,  y,  t )  dxdy .  (26) 

The  experimental  scaling  law  is 

{Rg,Rm)  ~t°-22±0M.  (27) 

The  exponent  0.32  ^  1/2  indicates  anomalous  diffusion  —  specifically  subdiffusion,  because  the  dis- 
persion  is  slower  than  diffusion. 

By  examining  typical  particle  trajectories,  such  as  the  one  in  figure  4,  Cardoso  et  aZ.explained  the 
subdiffusive  growth  in  terms  of  an  interrupted  random  walk.  Consider  a  random  walker  who  pauses 
between  steps.  The  length  of  the  pause,  r,  is  a  random  variable;  in  the  experiment  of  Cardoso  et 
al  the  pause  is  a  trapping  event  in  which  a  molecule  is  sequestered  in  the  core  of  a  stationary  vortex. 
If  the  average  duration  of  a  pause  is  well  defined  then  one  can  simply  use  Einstein’s  formula  (6) 
with  r  replaced  by  the  average  time  between  steps.  However,  if  the  pausing  times  are  very  broadly 
distributed  then  the  average  duration  of  a  pause  may  be  infinite  and  consequently  the  dispersion  is 
subdiffusive.  We  explore  this  in  more  details  in  the  next  section. 
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Figure  4:  The  trajectory  of  a  single  particle  shows  a  sequence  of  long  flights  interrupted  by  trapping  events 
in  which  the  particle  circles  around  a  vortex.  The  vortex  trapping  events  are  indicated  by  the  arrows.  (From 
Cardoso  et  al  [2]) 

3.3  Random  walk  with  pauses 

Consider  a  random  walk  in  which  the  walker  pauses  for  a  random  time  r  between  steps.  The  various 
-  s  have  a  probability  density  function  W(t)  (the  waiting  time  PDF).  This  PDF  is  normalised, 

[°°W(t)  dr  =  1 ,  (28) 

Jo 

and  the  average  waiting  time  spent  between  steps  is 

rOO 

f  =  /  tW  (r)  dr  .  (29) 

Jo 

Motivated  by  the  experiments  of  Cardoso  et  al ,  we  entertain  the  notion  that  f  is  infinite  because  the 
integral  in  (29)  diverges.  For  example,  suppose  that  for  large  r,  W(r)  ~  Then  f  =  oo  if  fx  <  2. 

However,  if  we  only  observe  a  finite  number  of  steps,  then  we  do  not  sample  the  entire  density 
1  V(r).  Specifically,  suppose  that  after  N  steps,  we  have  experienced  pauses  of  duration  Ti,  T2, . . .  ,rjv* 
We  want  to  estimate  the  likely  value  of  rmAX(N)  =  max{ri ,  r2 , . . .  ,r/v}.  The  quantity  rmax(iV)  is 
useful  because  we  can  argue  that  the  structure  of  W  (r)  for  r  >  rmax(iV)  cannot  be  significant  for  the 
displacement  after  N  steps. 


To  determine  rmgx(N ),  we  turn  to  probability  theory.  Consider  a  random  variable  9  uniformly 
distributed  in  the  interval  [0,1].  That  is,  the  PDF  of  9  is  P{9)  =  lifO<0<l  and  P(9)  =  0 
otherwise.  Suppose  we  take  N  samples,  , 9^  and  define  9m\n(N)  =  min{0i,*--  ,9jy}.  In  this 

simple  case  it  is  plausible  that  0min  ~  N~~l  as  N  -4  oo. 

Now  the  trick  is  to  use  9  to  represent  r:  we  write  9  =  rp,  and  adjust  p  so  that  the  power-law  tail 
of  W(r)  ~  corresponds  to  the  simple  structure  of  P(9)  =  1.  In  fact, 

=>  1  -  ,  (30) 

or  p  =  1  -  p.  Because  the  minimum  value  of  9  maps  to  the  maximum  value  of  r,  it  follows  that 

WW  ~  .  (31) 

Now  we  return  to  (29)  to  estimate  the  effective  average  pause  time  after  N  pauses: 

fT  max 

Teff  =  /  TW  (r)  dr  ~  .  (32) 

Jo 

It  is  also  plausible  that  the  total  time  t  spent  on  this  random  walk  is  given  by 

f~Nreff.  (33) 

Combining  (31),  (32)  and  (33)  yields  the  following  scaling  relationships: 

N  -  t^1  ,  Teff  ~  ,  Tmax  -  t .  (34) 

The  final  relation  is  worthy  of  comment:  it  implies  a  form  of  self-similarity  of  the  random  walk. 

To  conclude,  the  total  displacement  of  our  random  walk  is  proportional  to  y/N.  But,  with  the 
random  pauses,  the  scaling  against  time  has  been  altered  to 

RMS  displacement  oc  V~N  ~  t (#1“1)/2  .  (35) 

This  theory  can  be  used  to  interpret  the  experiment  of  Cardoso  et  ah:  because  the  RMS  displacement 
grows  as  t1/3  it  follows  that  p  «  5/3.  Cardoso  et  a/.successfully  tested  this  prediction  by  measuring 
the  PDF  of  trapping  times  inside  vortices. 

4  Stirring  and  mixing 

4.1  Coffee  and  cream 

Appealing  to  the  everyday  experience  of  mixing  cream  into  coffee,  Eckart  [5]  argued  that  the  homog¬ 
enization  of  two  fluids  occurs  in  three  stages.  The  distinction  between  the  stages  is  the  value  of  the 
concentration  gradient  averaged  over  the  domain. 

Initial :  there  are  distinct  interfaces  separating  globules  of  cream  and  coffee.  Within  each  globule,  the 
concentration  of  cream  is  nearly  constant  and  the  concentration  gradient  is  close  to  zero.  There 
is  a  very  large  concentration  gradient  between  regions  of  coffee  and  cream.  But  the  interfaces 
between  coffee  and  cream  are  small  in  number  and  not  of  great  area,  so  the  average  gradient  in 
the  coffee  mug  is  small. 

Stirring :  the  cream  is  mechanically  swirled  and  folded,  and  molecular  diffusion  is  unimportant. 
During  this  second  stage  the  concentration  gradients  increase. 


P{6)  =  W(t) 


dr 

d£ 
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Figure  5:  Solution  of  ct  +  (1  -  r2)ce  =  (8  x  10  4)  V2c.  The  initial  condition  is  c(x,  y,  0)  =  x. 


Mixing :  the  gradients  suddenly  disappear  and  the  fluid  becomes  homogeneous;  molecular  diffusion 
is  responsible  for  the  sudden  mixing. 

In  a  chemical  reaction,  molecules  of  different  species  must  come  into  contact  for  the  reaction  to  occur. 
Thus,  when  the  species  are  initially  separated,  the  reaction  will  not  begin  until  the  final  mixing  stage 
is  reached.  In  this  sense  there  is  an  important  distinction  between  coarse-grained  homogenization, 
occuring  solely  as  a  result  of  stirring,  and  mixing  at  the  molecular  scale. 

To  illustrate  these  concepts  figure  5  shows  a  solution  of  the  advection  diffusion  equation 

ct  +  (1  -  r2)ce  =  kV2c,  c(r,  0, 0)  =  r  sin  9  (36) 

where  k  =  8  x  10“4.  A  particle  at  a  distance  r  from  the  origin  completes  a  rotation  in  a  time 
27r/(l  —  r2).  Thus  particles  at  smaller  values  of  r  will  overtake  particles  at  larger  values  of  r  and  so 
the  concentration  is  twisted  into  spirals  by  differential  advection  (stirring). 

The  increase  in  gradient  during  the  stirring  phase  is  evident  in  the  figure.  But  at  approximately 
t  =  16,  mixing  starts  to  dominate,  and  diffusion  rapidly  reduces  the  average  gradient.  From  the 
initial  condition,  an  estimate  of  the  time  it  would  take  unassisted  diffusion  to  homogenize  the  fluid  is 
Td  rsj  1/k  =  1250.  It  is  only  through  the  initial  process  of  stirring  that  the  concentration  gradient  is 
amplified  or,  alternatively,  that  the  spirals  are  stretched  out  so  that  small  diffusion  homogenizes  the 
tracer  at  t  =  32  <C  To¬ 


ll 


Figure  6:  The  straining  flow  described  by  the  streamfunction  ip  =  -axy.  The  figure  shows  how  a  circular 
patch  of  tracer  is  stretched  out  along  the  z-axis  by  the  hyperbolic  strain.  If  k  =  0  the  major  axis  of  the  ellipse 
grows  as  exp(at)  and  the  minor  axis  reduces  as  exp(-crf)  so  that  the  area  remains  constant. 


4.2  A  straining  flow 

A  simple  example  of  a  two-dimensional  flow  which  amplifies  concentration  gradients  is  the  hyperbolic 
strain  shown  in  figure  6.  The  streamfunction  is  xp  =  -axy  and  so  the  advection  diffusion  equation  is 

ct  +  axcx  —  aycy  =  kV2c  .  (37) 

Notice  the  dimensions  here:  a-1  has  dimensions  “time”  and  k  has  dimensions  (length)2 /(time).  From 
those  two  quantities  we  can  build  a  combination  with  the  dimensions  of  (length): 


(38) 


1  le  length  £  will  appear  prominently  in  the  sequel. 

We  begin  our  discussion  of  hyperbolic  strain  by  obtaining  a  solution  in  which  c  is  independent  of 
both  j-  and  t.  In  this  special  case  the  solution  of  (37)  is 


cy  =  A  exp 


c(x,±oo,t)  =  ±V2n  AC. 


(39) 


The  concentration  profile  is  the  error  function  shown  in  figure  7.  The  solution  shows  the  steady  state 
balance  between  advection  and  diffusion:  with  \f2 nAE  =  1,  the  concentration  c  changes  smoothly 
between  c  =  +1  as  y  -*  +oc  to  c  =  -1  as  y  -¥  -oo.  The  transition  occurs  in  a  front  of  width  £. 

We  can  give  an  intuitive  discussion  of  how  the  steady  state  profile  in  figure  7  is  established  as 
t  he  solution  of  an  initial  value  problem.  Suppose  we  had  started  with  the  initial  condition  such  as 
r(x.  y.  0)  =  sgn(j/)  in  which  the  transition  between  c  =  —  1  and  c  =  +1  occurs  in  a  distance  much  less 
than  f.  Then  the  discontinuity  in  c  initially  diffuses  freely,  growing  like  s/kI.  Once  the  width  of  the 
front  becomes  comparable  to  t.  that  is  when 

V/tt  ~  £,  =>-  t  ~  a-1 ,  (40) 
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Figure  7:  The  time  independent  error  function  solution  to  equation  (37). 


the  spread  is  arrested  and  the  steady  state  in  figure  7  is  established. 

On  the  other  hand,  we  can  also  consider  an  initial  condition  in  which  the  transition  between  c  =  1 
and  c  =  -1  occurs  on  a  scale  L0  >  ^  In  this  case  the  front  is  initially  compressed  by  the  hyperbolic 
strain  so  that  the  width  is  reduced  exponentially,  L  =  L0exp(-at).  Because  L0  >  £  the  diffusion  is 
unimportant  until  the  exponential  reduction  in  scale  reaches  L  That  is, 

L0e~at  ~  t ,  =*  t  ~  a"1  ln(X0/4  (41) 

These  considerations  illustrate  the  fundamental  importance  of  £  as  the  scale  on  which  advection  and 
diffusion  come  into  balance. 

4.3  Lagrangian  coordinates:  a  simple  example 

The  hyperbolic  strain  also  provides  a  painless  illustration  of  some  mathematical  techniques  which  can 
be  used  in  more  complicated  problems.  We  begin  by  considering  the  solution  of  (37)  with  n  =  0. 
With  no  diffusion  c  is  tied  to  fluid  particles.  The  position  of  a  fluid  particle  is  related  to  its  initial 
position  (a,  6),  by  solving  the  differential  equations 

(x,  y)  =  a(x,-y),  =»  (x,  y)  =  (eaia,  e~atb) .  (42) 

The  solution  of  (37)  can  now  be  obtained  by  arguing  that  the  particle  which  is  at  the  point  (x,  y)  at  time 
t  began  at  (a,  b)  =  (exp(-at)x,exp(at)2/)  at  t  =  0.  Because  the  a  particle  carries  the  concentration  it 
follows  that  the  solution  of  (37)  as  an  initial  value  problem  is 

c(x,  y,t)  =  co  [exp (-at)x,  exp [at)y]  ,  (43) 

where  Co(x,y)  is  the  initial  condition.  The  philosophy  of  this  method  is  that  we  care  where  fluid 
particles  come  from,  but  not  where  they  are  going  to. 

The  solution  above  seems  to  rely  crucially  on  the  restriction  that  k  =  0.  But  now  look  what 
happens  if  we  use  the  Lagrangian  coordinates  (o,6)  in  (42)  as  new  independent  variables  in  (37).  As 
an  accounting  device,  it  is  comforting  to  define  r  =  t  and  consider  that  dT  as  the  time  derivative  with 
(a,  b)  fixed.  Thus  the  transformation  rules  are 

<a-8»> = (!■  I)  a° + (I  •  I)  ^  -  <«> 

and 

dt  =  ^7<9t  +  -^7 9 a  +  =  dT  -  aada  +  abdb.  (45) 

at  at  at 

The  punchline  is  that 

dt  +  axdx  -  aydy  =  dT ,  (46) 
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which  shows  that  the  change  to  a  Lagrangian  description  makes  the  convective  derivative  trivial. 
Substituting  the  transformations  above  into  (37)  gives: 


ct  =  ne  2atcaa  +  ne2atcbb, 


(47) 


Naturally,  if  k  —  0,  we  recover  our  earlier  solution  in  (43).  But  even  if  k  ^  0  it  is  often  easier  to  solve 
(47)  than  the  Eulerian  form  in  (37).  For  example,  Fourier  transforming  (47),  with  ( da:db )  — >■  i(p,g), 
gives  a  simple  ordinary  differential  equation  in  time. 

It  is  instructive  to  use  the  method  above  to  solve  (37)  with  the  initial  condition 

c{x,y,0)  =  8{x)8(y) .  (48) 


Physically,  this  is  a  spot  of  dye  released  in  a  straining  flow.  When  at  <  1  the  spot  spreads  diffusively, 
with  a  diameter  which  grows  as  \//c?.  However  when  at  ~  1  the  diameter  of  the  spot  becomes 
comparable  to  t  =  y/f^/a,  and  then  the  spot  stops  expanding  against  the  compressive  direction  of  the 
strain.  However  the  spot  continues  to  stretch  along  the  extensive  direction.  Thus,  when  at  >  1,  the 
spot  becomes  a  filament  with  an  equilibrium  width  of  order  t  and  an  exponentially  growing  length. 
These  intuitive  arguments  are  supported  by  the  exact  solution: 


c(x,y,t) 


1 

47i fg 


exp 


(49) 


where  f(t)  and  g(t)  are 


/2  =  JL(e2at_i))  g2  =  *_  (1  _  e— 2*t)  (50) 

Notice  that  the  peak  concentration  ultimately  decreases  like  e~2at. 

4.4  An  example  of  sudden  mixing 

As  a  final  look  at  the  hyperbolic  straining  flow,  we  note  that  a  solution  of  (37)  is 

c(x,y,t)  =  A(t)  cos(ke~atx)  cos(keaty),  (51) 

where 

A(t)  =  exp  [- l2k 2  sinh2af]  .  (52) 

One  route  to  this  exact  solution  is  to  look  for  separable  solutions  of  (47),  and  then  transform  back  to 
the  Eulerian  coordinates  (e.g.,  Young,  Rhines  &  Garrett, 1982). 

The  mean  value  of  the  square  of  the  concentration  gradient  varies  with  time  as: 

{Vc*  Vc}  =  cosh(2at)  exp  [-2 £2k2  sinh(2ai)]  ,  (53) 

where  {}  denotes  an  average  over  a  large  area.  {Vc* Vc}  is  plotted  in  figure  8  for  various  values  of 
kt  Recalling  Eckart’s  description  of  stirring  as  increasing  the  concentration  gradient,  and  mixing  as 
decreasing  the  concentration  gradient,  we  can  see  the  transition  between  the  two  phases  occurs  at  the 
peaks  of  the  various  curves.  If  kt  <  1,  then  the  time  it  takes  to  reach  this  peak  is  given  by  t*,  where 

at*~-ln(H).  (54) 

Once  again,  this  is  the  time  taken  for  the  exponential  factor  e~at  to  reduce  initial  length  of  the  tracer 
field,  k~l,  down  to  the  length  £  on  which  strain  and  diffusion  balance. 
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Figure  8:  The  mean  square  of  the  concentration  gradient.  If  kl  <  1  then  the  concentration  gradient  grows 
until  t  =  £*  in  (54)  and  then  decreases  precipitiously.  If  kt  >  1/2  then  diffusion  always  overpowers  strain  and 
the  mean  square  gradient  decreases  monotonically  to  zero. 


4.5  A  Welander  scrapbook 

Stirring  was  beautifully  illustrated  in  a  1955  paper  of  Welander ’s  [12].  This  paper  is  notable  also 
because  of  its  discussion  of  the  importance  of  coarse-grained  averages.  Figures  9, 10  and  11  reproduced 
from  Welander  (1955)  show  that  simple  velocity  fields  produce  spectacular  distortion  of  passive  scalars. 

In  figures  9,  10  and  11,  some  dimensions  of  the  scalar  blob  are  stretched  out  while  other  dimensions 
are  contracted.  Batchelor  (1952)  [1]  argued  that  in  turbulent  flows  random  stretching  results  in  an 
exponential  growth  of  the  separation  between  two  initially  adjacent  fluid  elements.  That  is,  if  we 
consider  two  material  elements  separated  by  a  distance  So  which  is  much  less  than  the  scale  of  the 
velocity  field,  then  Batchelor  argues  that  the  separation  grows  as 

5  -  s0e7< .  (55) 

The  time-scale  7-1  is  analogous  to  a”1  in  (37),  though  in  figures  10  and  11  the  exponential  straining 
is  driven  by  a  random  and  unsteady  velocity,  rather  than  the  simple  hyperbolic  field  in  figure  6.  Note 
particularly  that  the  exponential  law  in  (55)  is  valid  until  the  separation  s(t )  becomes  comparable  to 
the  length  scale  over  which  the  velocity  varies. 
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Figure  9:  Welander’s  numerical  solution  illustrating  differential  advection  by  a  simple  velocity  field.  A 
checkerboard  pattern  is  deformed  by  a  quasigeostrophic  barotropic  solution  which  models  atmospheric  flow 
at  the  500mb  level.  The  initial  streamline  pattern  is  shown  at  the  top  and  the  subsequent  figures  are  at  6 
hours,  12  hours,  24  hours  and  36  hours,  respectively.  Notice  that  each  square  of  the  checkerboard  maintains 
constant  area  as  it  deforms. 
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Lecture  2:  Diffusion 


1  Introduction 

Perhaps  you  have  heard  that  turbulence  is  the  most  difficult  problem  in  fluid  mechanics  and,  according 
to  some,  the  greatest  unsolved  problem  in  physics.  One  indication  of  the  difficulty  is  that  it  is 
impossible  to  give  a  satisfactory  definition  of  a  “turbulent  flow” .  But  everyone  agrees  that  one  property 
of  turbulence  is  greatly  enhanced  transport  of  passive  contaminants.  For  example,  relying  only  on 
molecular  agitation,  a  dissolved  sugar  molecule  takes  years  to  diffuse  across  a  coffee  cup,  and  on  that 
time-scale  the  coffee  will  surely  evaporate.  With  a  spoon  the  coffee  drinker  can  create  eddies  that 
transport  dissolved  sugar  throughout  the  cup  in  less  than  one  second.  This  is  an  example  of  eddy 
diffusivity. 

Fluid  mechanics  textbooks  often  often  justify  eddy  diffusivity  by  appealing  to  an  analogy  between 
turbulent  eddies  and  molecular  diffusion  —  perhaps  this  notion  originates  with  G.I.  Taylor’s  1905  paper 
entitled  “Eddy  motion  in  the  atmosphere”  [4].  In  any  event,  the  molecular  analogy,  supplemented 
with  some  hand-waving,  leads  to  the  notion  of  an  eddy  diffusivity  and  for  many  scientists  this  is  the 
end  of  the  turbulence  problem. 

Our  goal  in  this  lecture  is  to  explain  very  explicitly  the  assumptions  behind  Taylor’s  “proof  by 
analogy”  and  to  illustrate  the  interesting  points  at  which  the  analogy  fails.  We  will  pursue  this 
program  by  working  with  some  very  simple  model  flows  for  which  analytic  results,  such  as  expressions 
for  the  eddy  diffusivity,  are  available.  As  you  will  soon  see,  these  model  flows  do  not  greatly  resemble 
turbulence,  but  then  neither  does  molecular  motion!  Our  excuse  is  that  soluble  examples  are  always 
diverting  and  educational. 


2  The  renovating  wave  model 

2.1  A  recipe  for  constructing  soluble  models 

The  main  problem  in  analyzing  transport  is  solving  the  differential  equations  which  describe  the 
motion  of  particles  in  even  very  simple  flows.  However  there  is  a  class  of  flows  for  which  this  task  is 
trivial.  These  are  steady  and  unidirectional  flows,  such  as  u  =  siny.  A  particle  which  starts  at  (a,  b )  at 
t  —  0  finds  itself  at  (a  +  r  sin  b ,  b)  at  t  =  r.  This  is  dull,  but  it  becomes  more  interesting  if  at  intervals 
of  r  we  “renovate”  the  flow  by  randomly  picking  a  new  direction  along  which  the  velocity  acts.  In 
this  way  we  can  construct  a  sequence  of  iterated  random  maps  and  calculate  diffusivities,  and  other 
statistical  properties,  by  averaging  the  exact  solution.  I  learned  of  this  trick  from  the  literature  on 
dynamo  theory.  The  book  Stretch ,  Twist,  Fold:  the  Fast  Dynamo  is  highly  recommended  for  students 
interested  in  all  aspects  of  stirring  and  mixing  [1]. 

2.2  The  renovating  wave  (RW)  model 

As  a  particular  example  we  now  formulate  the  renovating  wave  (RW)  model.  We  divide  the  time  axis 
into  intervals 

In  =  {t :  (n  -  1)t  <  t  <  nr}  ,  (1) 
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and  in  each  interval  we  apply  a  velocity,  u  =  derived  from  the  streamfunction 

i>n(x,y,t)  =  fc-1  U  cos[A;  cos  0n  x  4-  &  sin  0n  y  +  <^n] ,  (2) 

where  6n  and  ipn  are  independent  random  variables  uniformly  distributed  in  the  interval  [-7 r,  7r].  Thus 
in  each  In  there  is  a  steady,  unidirectional  velocity  with  sinusoidal  profile  (a  single  wave).  There  is 
sudden  and  complete  loss  of  all  information  about  the  past  velocity  at  t  =  nr  because  at  these  instants 
we  “renovate”  the  velocity  by  picking  new  random  angles  6  and  (p.  (This  means  that  the  velocity 
correlation  function,  C(t),  is  zero  if  t  >  r.) 

The  renovating  wave  model  can  be  nondimensionalized  by  using  k~l  as  a  unit  of  length  and  and 
lf(Uk)  as  a  unit  of  time.  With  this  choice,  the  model  contains  a  single  dimensionless  parameter, 
7.  =  rkU ,  which  is  a  measure  of  the  persistence  of  the  motion.  Much  of  the  literature  on  random 
ad  vect ion-diffusion  uses  model  velocity  fields  which  are  ^-correlated  in  time.  We  can  recover  this  limit 
;i>  a  special  case  by  taking  r*  -*  0. 

Using  dimensionless  variables,  a  particle  which  is  at  xn  =  (xniyn)  at  tn  —  nr *  moves  to  xn+i  at 
t  -  (v  +  1)t*,  where 

{xn+i,yn+i)  =  {xn,yn)  +  T*sm{cnx  +  sny  +  <pn){sn,-cn).  (3) 

with  ,s„  =  sin 0n  and  cn  =  cos 6n .  Thus  motion  in  the  renovating  wave  problem  is  equivalent  to  an 
iterated  sequence  of  random  maps. 

2.3  The  single-particle  diffusivity 

It  in  \-er\*  easy  to  calculate  the  diffusivity  in  the  RW  model  (and  much  more  difficult  to  interpret  the 
answer).  The  average  of  a  function  of  the  two  random  angles  6  and  ip  (suppress  the  subscript  n)  is 
defined  by 

«=/f/ <4> 

1  t^*rofore.  using  (3), 

((xn+1-xn)2)  =  |.  (5) 

1  ie-  computation  is  trivial  if  the  integral  over  cp  is  evaluated  first. 

In  (5).  following  our  previous  discussion  based  on  Einstein’s  derivation  of  the  diffusion  equation, 
\u  are  computing  the  statistics  of  dispersion  along  the  z-axis.  Because  the  renovating  wave  model  is 
t-otropic.  dispersion  in  the  y-direction  is  identical  to  that  in  the  x-direction. 

Because  all  of  the  waves  are  independent  and  identically  distributed  it  follows  that  after  n  reno- 
v. it  ion  cycles 

((*n-*o  )2)  =  n^f.  (6) 

But  t  =  nr*,  and  ((xn  —  £o)2)  =  2 D£,  so  that  using  dimensionless  variables  the  diffusivity  is 

d=t  <7> 

Sometimes  D  is  referred  to  as  the  single-particle  diffusivity.  “Single-particle”  emphasizes  that  D 
strictly  applies  only  to  the  RMS  displacement  of  a  particle  from  its  initial  position;  D  contains  no 
information  concerning  the  deformation  of  a  patch  of  tracer,  nor  of  any  other  quantity  involving 
correlated  motion.  Thus,  using  dimensional  variables,  the  diffusivity  in  (7)  is  D  =  U2r/S,  which  is 
independent  of  k.  Because  D  is  independent  of  the  scale  of  the  wave,  even  a  spatially  uniform,  but 
random-in-time  velocity  (the  case  k  =  0),  has  a  single-particle  diffusivity. 


21 


t=1  X 


t=4x 


t=2x 


t=5x 


t=3x 


t=6x 


Figure  1:  Stretching  of  a  small  spot,  r  <  1  where  r  is  the  initial  radius  of  the  spot,  by  a  succession  of  random 
sinusoidal  flows.  The  dotted  circle  is  the  initial  spot. 


t=1x  t=2x  t=3x 


Figure  2:  Stretching  of  a  blob  with  r  =  1,  where  r  is  the  initial  radius.  The  dotted  circle  is  the  initial  patch. 
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Figure  3:  Stretching  of  a  big  blob  r  1,  where  r  is  the  initial  radius  of  the  blob.  The  dotted  circle  representing 
the  initial  patch  may  not  be  visible  beneath  the  wiggly  boundary  of  the  blob. 


2.4  Deformation  of  variously  sized  blobs 

To  emphasize  the  importance  of  understanding  more  than  single-particle  diffusivities  we  take  a  digres¬ 
sion  and  illustrate  how  the  deformation  of  an  initially  circular  blob  of  fluid  depends  on  the  blob  radius 
r.  (Recall  that  we  have  used  A:-1  as  unit  of  length;  in  terms  of  dimensional  variables  the  relevant 
nondimensional  parameter  is  kr.) 

If  the  initial  blob  is  much  smaller  than  the  wavelength  of  the  velocity  then  on  the  scale  of  the 
blob  the  velocity  profile  is  a  linear  function  of  the  coordinates.  Because  of  this  simplicity,  the  first 
few  iterations  deform  the  circular  blob  into  an  ellipse  which  must  have  the  same  area  as  the  initial 
circle.  We  will  see  in  the  next  lecture  that  the  major  axis  of  the  ellipse  grows  exponentially  while  the 
minor  axis  shrinks  so  that  the  area  is  fixed.  Once  the  dimensions  of  the  ellipse  axe  comparable  to  the 
wavenumber  of  the  flow,  more  complicated  deformations  occur.  Ultimately  the  blob  will  be  stretched 
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into  a  folded  filament  as  in  figure  1. 

The  blob  has  the  same  scale  as  the  velocity  field  if  r  ~  1.  Because  there  is  no  scale  separation 
there  is  no  easy  description  of  the  action  of  the  flow  on  the  blob,  see  figure  2. 

If  r  1  then  we  are  in  the  “eddy  diffusivity”  limit  in  which  the  scale  of  the  velocity  field  is  much 
smaller  than  the  scale  of  the  tracer.  This  case  is  shown  in  figure  3.  The  action  of  the  waves  perturbs 
the  edge  of  the  blob,  making  it  look  “fuzzy”.  In  fact,  the  area  is  preserved,  but  the  circumference 
of  the  blob  grows  exponentially.  We  will  be  discussing  this  type  of  problem  for  the  remainder  of  the 
lecture. 


2.5  The  Lagrangian  correlation  function 

In  (7)  we  gave  the  diffusivity  of  particles  moving  in  an  ensemble  of  renovating  waves.  How  do  we 
obtain  the  Lagrangian  velocity  autocorrelation  function  and  verify  Taylor’s  formula  that 


(8) 


Considering  this  question,  we  encounter  an  annoying  technical  difficulty:  our  derivation  of  (8)  assumes 
that  the  velocity  statistics  are  stationary.  But  the  renovating  wave  ensemble,  as  we  defined  it  back 
in  { 1 )  and  (2),  is  not  a  stationary  stochastic  process.  This  is  because  with  our  original  definition  all 
members  of  the  ensemble  renovate  at  the  same  instants  t  =  r,  t  ~  2t  etcetera.  In  order  to  obtain 
a  stationary  process  we  should  initiate  different  realizations  at  uniformly  distributed  points  during 
the  renovation  cycle.  Thus,  for  realization  number  j ,  we  pick  a  random  time  which  is  uniformly 
distributed  in  the  interval  [0 ,r]  and  renovate  first  at  t  =  and  then  subsequently  at  t  ~  r^’)  +  r, 
t  =  r[j)  4-  2r  etcetera.  With  this  new  and  improved  formulation  of  the  RW  model  the  Lagrangian 
correlation  function  of  u(t)  is  a  “triangular”  function: 

c(t)  =  ^(l-^jH(r-t),  (9) 


where  H  is  the  step  function  and  U  is  the  velocity  in  (2).  The  area  under  this  correlation  function  is 

/»  =  r-v/8. 


3  The  eddy  diffusion  equation 

3.1  The  ensemble  averaged  Green’s  function 

Now  that  we  have  obtained  the  RW  diffusivity  in  (7)  we  turn  to  the  derivation  of  the  eddy  diffusion 
fciuation.  For  each  realization  we  introduce  the  Green’s  function  which  is 

G(  +  u-VG  =  0,  with  G(x,xo,0)  =  <5(x -xo).  (10) 


The  solution  of  the  problem  above  is 


G(x,x0,t)  =  S(xt  -  x0) ,  (11) 

where  xt  is  the  position  at  time  t  (in  a  particular  realization  of  u)  of  the  particle  which  started  at  xo- 
The  ensemble  averaged  Green’s  function  is 

g(r,t)  =  (G(x,xo,t)),  r  =  |x-x0|,  (12) 

where  we  have  assumed  that  the  random  velocity  is  isotropic,  homogeneous  and  stationary  so  that  g 
can  depend  only  on  the  distance  r  and  the  elapsed  time  t. 
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Possessing  g(r ,  t ),  we  can  then  represent  the  ensemble-averaged  solution  of  the  initial  value  problem 

Ct  +  u*  Vc  =  0,  c(x,0)  =  co(x) ,  (13) 

as  the  convolution 

(c)  (x,  t)  -  Jco(x-  x')g(\x' I ,  t)  dx' .  (14) 

(We  are  assuming  that  the  initial  condition  co  is  the  same  for  all  realizations.) 

At  this  point,  the  analogy  between  (14)  and  the  master  equation  of  lecture  1  is  obvious.  With 
the  master  equation  in  mind,  we  can  anticipate  that  a  variant  of  Einstein’s  derivation  of  the  diffusion 
equation  can  be  applied  to  (14).  Rather  than  develop  a  general  derivation  we  prefer  to  use  the 
renovating  wave  model  as  a  concrete  illustration  of  how  one  can  obtain  g ,  and  then  pass  from  the 
integral  equation  in  (14)  to  an  approximate  diffusion  equation. 

3.2  The  averaged  Green’s  function  of  the  RW  model 

There  are  at  least  two  ways  of  obtaining  g(r)  in  (12)  for  the  RW  model:  the  hard,  straightforward 
way  (see  the  appendix)  and  the  easy,  devious  way.  Let  us  be  devious. 

We  begin  by  calculating  the  probability  density  function  (PDF)  of  displacements  in  a  single  pulse 
of  the  RW  model.  Because  the  ensemble  of  velocities  is  isotropic  and  homogeneous  there  is  no  harm  in 
supposing  that  the  particle  is  at  the  origin  and  the  a>axis  is  aligned  with  the  direction  of  the  velocity. 
That  is,  put  (. xn,yn )  =  (0,0)  and  6n  =  n/2  in  (3).  Thus,  the  displacement  r  produced  by  a  single 
pulse  is 


£n+i  -  xn  =  r*  sin (fn,  and  r=  \xn+i  -  xn\. 


(15) 


The  PDF  of  the  random  variable  r  can  be  obtained  from  the  PDF  of  </?,  that  is  P((p)  =  l/2n,  using 
the  rule  for  transforming  probabilities: 


w-EHfl- 


P(r)=2-H{T--r) 


(16) 


In  (16)  i?(r*  —  r)  is  a  Heaviside  step  function  which  ensures  that  there  are  no  displacements  greater 
than  r*.  (The  sum  in  (16)  is  because  there  are  four  values  of  ip  corresponding  to  a  single  value  of  r.) 
The  averaged  Green’s  function  is  now  given  by 


ff(r)  = 


P(r) 

27T  r 


g(r)  = 


1  ff(r.-r) 
7T2  ry/Tl  -  r2  ‘ 


(17) 


The  geometric  factor  2 nr  is  included  because  g(r)  is  a  concentration.  That  is,  P(r)dr  the  expected 
number  of  particles  which  fall  into  the  differential  annulus  between  r  and  r  4*  dr  and  g(r)  is  the 
expected  number  of  particles  per  area  in  this  same  annulus;  see  figure  4. 

Now  that  we  have  the  averaged  Green’s  function  of  a  single  pulse  we  can  obtain  the  evolution  the 
ensemble  averaged  concentration,  (c),  over  many  pulses.  Because  each  pulse  is  independent  of  the 
preceeding  pulses  we  have 


(c)  (x,  (n  +  1)  r„)  =  J(c){x-x',nT„)g{ |x'|)dx'.  (18) 

The  master  equation  above,  with  g(r)  in  (17),  is  an  exact  description  of  the  evolution  of  (c)  under 
advection  by  the  RW  model. 
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Figure  4:  Displacements  of  40, 000  particles  in  independent  realizations  of  the  the  RW  model.  The  left  panel 
shows  the  final  position  of  particles  which  all  start  at  the  center  of  the  the  circle  after  one  pulse  of  the  wave. 
The  density  of  points  corresponds  to  g(r)  in  (17).  The  histogram  on  the  right  shows  the  number  of  particles 
at  a  distance  r  from  the  center;  this  is  the  function  P(r)  in  (16). 


3.3  The  diffusion  equation 

With  the  master  equation  (18)  in  hand,  we  can  use  Einstein’s  approximations  to  obtain  the  diffusion 
equation.  Using  the  dimensionless  variables  of  the  renovating  wave  model,  we  have 

(c)f«^V2(c).  (19) 

We  leave  this  as  a  homework  exercise  and  instead  we  take  a  different  route  to  (19). 

Because  the  Fourier  transform  of  a  convolution  is  the  product  of  the  Fourier  transforms,  we  can 
simplify  (18)  by  transforming.  The  Fourier  transform  of  /(x)  is  defined  here1  as 

/( k)  =  J e-ik  x/(x)  dx ,  /(x)  =  T  J eik  x/(k)  d k .  (20) 

Applying  the  transform  to  (18)  we  obtain 

(c)(k,nr*)  =  <?(fc)nCo(k) ,  k  =  |k| .  (21) 

With  a  good  table  of  integrals  one  can  discover  that  the  Fourier  transform  of  the  averaged  Green’s 
function,  g(r)  in  (17),  is 

g(k)  =  Jq  (fcr»/2) ,  (22) 


where  J0  is  the  Bessel  function. 

The  diffusion  equation  describes  the  evolution  of  large  spatial  scales,  which  is  the  same  as  small 
wavenumbers.  This  means  that  we  simplify  (21)  by  taking  kr,/2  <C  1  and  using  the  approximation 
Jo(fcr»/2)  « 1  —  (fc2r2  /16)  to  write 

(c)(k,nr„)  «  exp  {nln  [l  -  (fc2r2/ 8)] }  co(k) .  (23) 

1  By  denoting  the  wavenumber  with  k  we  are  recycing  notation  used  in  (2). 
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But  now,  since  n  =  t/r*  and  ln[l  —  (fc2r2/8)]  «  — fc2r2/ 8,  we  have 

(c)(k,t)  =  e_£)fc2fCo(k)  (24) 

where,  as  in  (19),  D  =  r* /8.  Equation  (24)  is  the  equivalent  to  the  decay  of  Fourier  components  given 
by  (19). 

This  derivation  based  on  Fourier  analysis  explicitly  recognizes  that  the  diffusion  approximation  is 
valid  only  for  wavenumbers  which  satisfy  hr* /2  <  1.  This  is  a  precise  statement  of  the  scale  separation 
assumption  which  underlies  Einstein's  approach. 


4  Ensemble  averages  and  single  realizations 

In  hydrodynamic  dispersion,  particles  which  begin  at  neighbouring  points  have  similar  histories  in 
any  single  realization.  Marbled  endpapers  in  old  books  were  produced  by  floating  coloured  inks  on 
water,  stirring  the  surface,  and  then  capturing  the  swirls  by  carefully  lowering  a  sheet  of  paper  onto 
the  inky  film  [3].  This  technique,  probably  originating  in  Persia  in  the  1400s,  presses  hydrodynamic 
correlations  into  the  service  of  art.  Fortunately  for  printers,  and  distressingly  for  statisticians,  a  single 
realization  does  not  resemble  the  blurry  diffusion  equation. 


4.1  Eddy  diffusion  of  a  front 

Figure  5  shows  a  single  realization  of  the  evolution  of  a  “front”  under  the  RW  advection  process.  The 
front  is  the  sharp  border  which  separates  white  from  dark;  initially  this  line  coincides  with  the  y- axis. 
We  suppose  that  the  concentration  is  c  =  —  1  for  x  <  0  and  c  =  +1  for  x  >  0.  Successive  pulses  of  the 
renovating  wave  produce  an  increasingly  folded  front  and  the  c  =  —  1  fluid  invades  the  region  x  >  0 
in  long  thin  tendrils.  The  central  question  is: 

j How  well  is  the  process  in  figure  5  described  by  the  diffusion  equation? 


We  know  that  given  many  realizations  of  this  process,  the  long-time  ensemble  average  of  these 
realizations  will  follow  the  diffusion  equation  ( c)t  =  D{c)xx ,  with  the  initial  conditions  c(x,0)  =  ±1. 
The  solution  of  this  problem  is 


(c)  =  erf  7] , 


where 


^  ~  2^/Dt 


(25) 


Figure  6  shows  this  smooth  erf  solution  which,  of  course,  looks  nothing  like  figure  5.  If  the  dark  fluid 
in  figure  5  contained  radioactive  contaminant,  and  we  wanted  to  estimate  the  maximum  exposure  of 
at  some  value  of  x  >  0,  then  the  erf  solution  in  (25)  is  not  useful. 

Oil  the  other  hand,  diffusivities  are  useful  if  we  want  to  know  how  many  particles  are  at  such-and- 
such  a  distance  from  their  initial  location.  Thus,  figure  7  shows  a  histogram  of  the  positions  of  10,000 
particles  which  all  start  on  the  line  x  =  0  (the  initial  front).  The  Gaussian  curve  in  figure  7  is  the 
corresponding  prediction  for  the  PDF  of  positions  which  is  obtained  by  solving  (19)  with  the  initial 
condition  (c)  =  5 (a;): 


c(x,t ) 


1 

, . exp 

yjAnDt 


x 2 
4  Dt 


(26) 


The  histograms  converge  slowly  to  this  Gaussian  prediction.  This  asymptotic  success  shows  that  the 
diffusion  equations  correctly  predicts  the  dispersion  of  particles  when  t  r*. 

An  amusing  aspect  of  the  simple  problem  in  figure  5  is  that  we  can  easily  calculate  the  RMS 
fluctuations  of  c  around  the  ensemble  average  concentration  in  (25).  Because  c  =  ±1  we  have  (c2)  =  1. 
Therefore,  defining  the  fluctuation  as 


c'  =  c  -  (c) , 


(27) 
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Figure  5:  Evolution  of  a  front  under  the  advection  by  the  RW  model.  The  front  initially  coincides  with  the 
y~  axis. 
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Figure  6:  Evolution  of  the  ensenble- averaged  concentration  c  and  its  variance  during  the  evolution  of  the 
front  underthe  RRW  model.  Note  how  most  of  the  variance  is  localised  around  x  =  0. 
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Figure  7:  At  t  =  0  the  front  in  figure  5  is  tagged  by  placing  10000  particles  along  the  y-axis.  The  historgram 
above  shows  the  subsequent  ^-locations  of  these  marker  particles  as  the  front  is  distorted  by  the  RW  model 
with  r.  =  1.  The  Gaussian  curve  is  given  by  (26). 
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In  a  general  case,  in  which  there  is  no  statistical  symmetry  along  a  particular  direction,  one  can 
take  a  single  realization  and  define  a  coarse-grained  or  low-pass  filtered  concentration  by: 

c(x, t)  =  ^(x.x'MxVJdV;  #(|x|)is  a  filter.  (30) 

The  hope  is  that  scale  separation  between  the  width  of  the  erf  and  the  swirls  will  ensure  that  c  &  (c). 
Thus  the  kernel  of  the  filter,  K  in  (30),  might  be  a  Gaussian  with  a  width  which  is  at  once  much 
smaller  than  the  thickness  of  the  erf  transition  zone  and  much  greater  than  an  individual  swirl  in 
figure  5. 

Scale  separation  is  essential  here  because  the  filtering  operation  defined  by  the  convolution  in  (30) 
is  not  strictly  an  “average” .  Some  of  the  properties  we  take  for  granted  when  we  use  averages  are 

(c'>  =  0 ,  ((c))  =  (c) ,  ((a)(b))  =  (a)(b) .  (31) 

For  the  ensemble  average,  as  indicated  in  (31),  everything  works. 

For  a  filter,  such  as  "in  (30),  we  can  define  the  fluctuation  concentration  c"  in  analogy  with  (27): 

c"  =  c-c.  (32) 

But  then  c"  ^  0  and  none  of  the  other  desiderata  in  (31)  follow.  In  other  words,  spatial  filtering 
instead  of  the  ensemble  averaging  introduces  a  host  of  extra  assumptions  which  should  be  carefully 
assessed  (but  almost  never  are). 

4.3  A  digression:  Brownian  bugs 

I  have  hinted  darkly  at  problems  associated  with  spatial  filters.  These  issues  are  largely  ignored  by 
optimistic  scientists.  The  hope  is  that  scale  separation  justifies  the  application  of  diffusive  closures  to 
the  coarse-grained  version  of  a  single  realization.  Perhaps  a  justification  of  this  optimistic  approach 
is  that  the  alternative  seems  so  repellent.  Nonetheless,  it  is  important  to  realize  that  interpreting 
coarse-grained  distributions  as  ensemble  averages  involves  a  nontrivial  assumption.  The  best  way  of 
exposing  this  assumption  is  to  exhibit  a  problem  in  which  spatial  filters  and  ensemble  averages  are 
very  different.  Accordingly,  as  a  model  of  biological  processes,  we  consider  random  walkers  which 
both  die  and  reproduce.  We  refer  to  these  biological  walkers  as  Brownian  bugs. 

The  model  is  formulated  by  first  placing  N  >  1  Brownian  bugs  randomly  in  the  unit  square; 
the  boundary  conditions  are  periodic  in  both  directions.  Each  cycle  of  the  simulation  begins  with  a 
random  walk  step  in  which  bug  fc,  located  at  x*  =  (x^,yfc),  is  displaced  to  a  new  position 

(x'k  >  y'k)  =  mod  [(a;* ,  J/fc )  +  (<5x* ,  Syk ) ;  1]  •  (33) 

In  (33),  Sx k  and  6yk  are  Gaussian  random  variables  and  the  “mod”  is  to  enforce  the  periodic  boundary 
conditions  and  keep  each  bug  in  the  unit  square.  After  this  random  walk  step,  the  second  part  of  the 
cycle  is  a  “coin  toss”  which  results  in  either  death  (heads)  or  division  (tails).  When  a  lucky  bug  divides, 
the  offspring  is  placed  at  the  same  position  as  the  parent.  This  cycle  of  random  displacement  and 
random  birth/death  is  repeated  many  times  in  order  to  simulate  many  generations  of  reproduction, 
death  and  dispersion. 

The  simulation  shown  in  figure  8  was  implemented  in  MATLAB  using  these  rules.  The  striking 
result  is  that  the  density  of  bugs  spontaneously  develops  large-scale  clumps  and  voids.  Figure  8  seems 
to  show  an  inverse  cascade  of  patch  sizes:  patches  emerge  on  small  scales  in  panel  (b)  and  then, 
after  more  cycles,  panels  (c)  and  (d)  show  that  the  patches  have  expanded  in  scale.  To  quantify  this 
impression,  we  have  computed  one-dimensional  concentration  spectra  which  show  that  an  increasingly 
red  spectrum  develops. 

A  seemingly  innocuous  ingredient  of  the  brownian-bug  model  is  that  deaths  can  occur  anywhere, 
but  births  are  always  adjacent  to  a  living  bug.  This  asymmetry  between  birth  and  death  is  crucial  for 
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(a)  Initial  condition,  N=20,000 
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(b)  10  cycles,  N=1 9,692 


(c)  100  cycles,  N=1 7,728 


(d)  1000  cycles,  N=21,814 


Figure  8:  (a)  The  initial  condition  is  N  =  20,000  randomly  located  bugs  in  the  unit  square.  Panels  (b),  (c) 
and  (d)  then  show  the  development  of  patches  after  10,  100  and  1000  cycles  of  random  displacement  followed 
by  random  birth/death.  As  the  panel  titles  indicate,  there  axe  random  fluctuations  in  the  total  size,  N,  of  the 
population.  The  RMS  step  length  of  the  underlying  random  walk  is  (8x |)1/2  =  =  0.005. 


the  spontaneous  development  of  the  voids  and  patches  evident  in  figure  8:  if  one  simulates  birth  by 
randomly  placing  the  new  bugs  in  the  unit  square  then  no  patches  form.  This  subtle  point  shows  that 
making  the  births  coincide  with  living  bugs  —  surely  a  realistic  feature  of  the  model  —  introduces 
pair  correlations.  From  another  perspective,  one  can  view  the  voids  in  figure  8  as  the  result  of  random 
extinctions  which  create  voids.  The  step  length  of  the  random  walk  in  figure  8  is  such  that  diffusion 
is  not  strong  enough  to  fill  in  the  voids  created  by  extinction. 

The  ensemble  average  of  the  Brownian  bug  process  is  described  by 

(c)t  =  DX72(c)  +  (A  —  fx)(c) .  (34) 

where  A  is  the  birthrate  and  fx  the  deathrate.  However  if  the  coin-toss  is  fair  then  births  and  deaths 
are  equiprobable  and  consequently  A  =  fx.  In  this  case  the  solution  of  (34)  which  satisfies  the  initial 
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condition  is 


A  =  M,  =►  (c)  =  l/N.  (35) 

The  uniform  density  above  is  the  correct  answer  for  the  ensemble  average  concentration:  the  location 
of  the  voids  and  patches  in  figure  8  are  accidently  determined  by  the  MATLAB  random  number  generator. 
If  we  ensemble  average  many  such  patterns  then  the  patches  and  voids  must  disappear  because  the 
process  is  spatially  homogeneous. 

On  the  other  hand,  the  spatial  average  of  a  single  realization,  such  as  that  in  figure  8,  will  still 
show  concentration  patches2.  Thus,  in  this  Brownian  bug  example,  c  ^  (c).  Indeed,  the  patches 
are  surely  an  important  feature  of  the  “real”  answer.  The  correct  but  useless  result  in  (35)  exposes 
a  failure  of  ensemble  averaging.  What  do  we  make  of  this  example?  Are  biological  problems,  with 
reproduction  and  death,  so  fundamentally  different  from  the  advection-diffusion  of  chemical  tracers? 
I  am  not  prepared  to  answer  that  question  in  these  lectures  and  I  leave  further  development  of  this 
example  to  the  students. 


5  Variance  budgets 

In  this  section  we  return  to  basics  and  present  an  alternative  view  of  eddy-diffusivity.  The  following 
arguements  emphasize  the  importance  of  the  concentration  variance  equation. 

5.1  The  Reynolds’  decomposition 

Our  point  of  departure  is  the  advection-diffusion  equation 

ct+u-Vc=/tV2c  +  s,  (36) 

where  n  is  the  molecular  diffusivity  of  c  and  u  is  an  incompressible  (V-u  =  0)  velocity  field.  In  (36) 
we  have  included  a  source  term,  s(x,t),  which  forces  the  system. 

The  velocity  u  in  (36)  is  a  single  realization  selected  from  an  ensemble  of  velocity  fields.  Then  we 
can  introduce  the  “Reynolds’  decomposition”: 

c=(c)  +  c',  (37) 

where  ( )  is  the  ensemble  average  and  c'  is  the  fluctuation  from  (c)  which  arises  in  a  single  realization. 
Taking  the  ensemble  average  of  (36)  gives 

(c)t  +  (u)- V(c)  +  V-(u'c')  =  kV2(c)  +  s .  (38) 

(The  source  .s  is  taken  to  be  deterministic,  ( s )  =  s.) 

Subtracting  the  ensemble  average  in  (38)  from  (36)  gives  the  fluctuation  equation 


c't  +  (u)-Vc'  +  V  •  [u'c'  -  (u'c')]  -  kV2c'  =  — u'-V(c) .  (39) 

Equation  (39)  has  been  organized  by  taking  the  source  term  to  the  right  hand  side.  Thus  we  see  that 
advective  distortion  of  the  mean  gradient,  V(c),  generates  the  fluctuation  c'. 

2  If  the  width  of  the  kernel,  K  in  (30),  is  larger  than  the  dimension  of  the  patches  then  filtering  will  remove  the 
patches.  However,  since  the  patches  expand  in  scale,  eventually  they  will  become  so  large  that  they  survive  the  blurring 
power  of  the  filter. 
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5.2  Consequences  of  linearity 

If  c'  =  0  at  t  =  0  then,  because  (39)  is  linear,  c'  and  V(c)  will  be  linearly  related.  It  follows  that  the 
eddy  flux  (uV)  will  also  be  linearly  related  to  the  mean  gradient  V(c).  These  simple  observations, 
in  alliance  with  the  scale  separation  assumption,  can  be  used  to  extract  a  surprising  amount  of 
information  [2] . 

Because  of  the  scale  separation,  it  is  plausible  that  this  linear  relation  between  eddy  flux  and  mean 
gradient  can  be  developed  in  a  series  of  the  form 

{u'iC1)  =  -v\]]  *  (c)j  -  vf)k  *  {c)jk  +■■■  (40) 

Tin*  comma  subscripts  denote  partial  derivatives,  aj  =  da/dxj.  We  are  also  using  the  Einstein  sum 
convention,  where  repeated  indices  are  summed.  The  *  in  (40)  indicates  that  the  product  also  involves 
convolutions  in  time,  such  as 


,(i) 


13 


*(c),i 


(41) 


If  the  mean  field  is  varying  slowly  over  an  eddy  decorrelation  time  then  the  convolution  above  ap¬ 
proximates  to 


poo 

Kc')  «  -vV*(c)j  »  -  /  2?g>(Od *'<c), ,•(<)•  (42) 

Jo 

In  tlu*  simplest  cases3 

=  DeSljl  (43) 

Jo 

where  Dt  is  the  eddy  diffusivity.  Using  (43)  the  flux  gradient  relation  is 

<u'c')  -  kV{c)  =  ~DV(c) ,  D  =  De  4-  k  ,  (44) 

and  the  evolution  of  the  average  concentration  is  determined  by 

(c)t  «DV2(c)+s.  (45) 

Tin-  is  a  general  version  of  the  specific  diffusion  equation  derived  in  Section  3.3  for  the  renovating 

wave  model. 


5.3  The  G  •  x-trick 

The  tensors  V^n\t)  are  determined  by  the  linear  operator  on  the  left-hand  side  of  (39).  Thus,  these 
tensors  depend  on  (i)  the  statistical  properties  of  ux;  (ii)  the  mean  advection  (u);  (Hi)  the  molecular 
diffusion  k.  The  essential  point  is  that  these  tensors  do  not  depend  on  (c).  At  least  for  the  first  term 
m  the  series,  we  can  exemplify  this  by  noting  that  there  is  a  special  solution  of  (36)  in  which 
u  =5  =  0  and  concentration  has  the  form 


c  =  G  •  x  +  d  .  (46) 

3 ‘‘Simple”  means  that  the  velocity  ensemble  is  isotropic,  homogeneous  and  reflexionally  invariant.  The  last  re¬ 
quirement  means  that  the  mirror  image  of  a  particular  realization  of  u'  is  just  as  probable  as  u'.  If  the  ensemble  is 
reflexionally  invariant  then  is  a  symmetric  tensor.  This  subtle  point  will  be  illustrated  later  in  this  lecture  series. 
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In  (46)  the  mean  concentration  is  simply  ( c )  =  G  •  x  and  the  fluctuation  d  is  determined  from  a 
reduced  version  of  (39) : 


dt  +  u'- Vc'  -  «VV  =  -G  *  u' .  (47) 

As  emphasized  above,  the  advection  of  the  mean  gradient  appears  as  a  source  term  for  d  on  the  right 
hand  side  of  (47).  Because  (47)  is  linear,  and  G  is  constant,  the  solution  d  will  be  proportional  to 
the  large-scale  gradient  G  and  otherwise  independent  of  G. 

This  G  •  x-trick  enforces  the  platonic  ideal  of  scale  separation  between  the  eddies  and  the  mean 
field.  If  the  concept  of  an  eddy  diffusivity  is  to  have  any  validity,  then  it  must  work  in  the  simplified 
context  of  (47).  In  fact,  the  G  •  x-trick  is  used  in  doubly-periodic  turbulence  simulation  to  calculate 
eddy  diffusivities.  In  that  context,  u'  =  (tqv)  and  d  are  efficiently  represented  by  Fourier  series.  Then 

(47)  is  solved  using  a  spectral  code  and  the  eddy  flux  is  estimated  by  computing  the  integral 

{u'c')  =  A~l  JJ u'c'  dx  d y ,  (48) 

over  the  computational  domain.  (In  (48)  A  is  the  total  area  of  the  domain  so  (1)  =  1).  Notice  that  in 

(48)  the  ensemble  average  is  identified  with  an  integral  over  the  domain.  Later  in  these  lectures  we 
will  use  this  same  procedure  to  analytically  calculate  the  eddy  diffusivities  of  some  spatially  periodic 
velocity  fields. 

5.4  The  concentration  variance  equation 

An  equation  for  the  concentration  variance, 

2  =  ^<c'2>,  (49) 

is  obtained  by  multiplying  (39)  by  d  and  ensemble  averaging.  The  result  is 

Zt  +  (u)-V-Z  +  V  •  du'c'2)  -  kV2Z  =  -k(Vc'-Vc')  -  (uV)-V(c) .  (50) 

The  terms  on  the  left-hand  side  of  (50)  can  be  interpreted  as  fluxes  of  Z .  The  two  terms  on  the  right 
hand  side  of  (50)  are  respectively  dissipation  of  variance  by  molecular  diffusion,  k,  and  a  source  of 
variance  due  to  advective  distortion  of  the  mean  gradient. 

5.5  Heuristic  closure  arguments 

In  (50)  there  are  three  terms  which  we  would  like  to  relate  to  the  mean  quantities  (c)  and  Z .  First, 
there  is  — (u'c')-V(c)  =  DeV(c)-V(c).  The  remaining  two  terms  are  (u'c,2/2)  and  k(Vc'-Vc'). 

The  correlation  (u'c/2/2)  in  (50)  is  an  eddy-flux  of  c'2,  just  as  (uc;)  is  an  eddy  flux  of  d.  Thus, 
by  analogy  with  (44),  we  can  argue  that 


i(u  'c'2)  =  -DeVZ.  (51) 

This  heuristic  argument  is  discussed  further  in  appendix  B. 

The  final  term  in  (50)  is  the  dissipation  of  variance  by  molecular  diffusivity,  ac(Vc'-Vc').  The 
simplest  closure  assumption  we  can  make  about  this  term  is  that 

K{Vd-Vd)*pZ,  (52) 
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where  /?  has  the  dimensions  of  time.  The  closure  above  relies  on  dimensional  analysis  and  the  linearity 
of  (36).  However,  in  anticipation  of  a  later  discussion  of  the  Batchelor  spectrum,  we  now  make  some 
heuristic  arguments  in  support  of  (52)  which  suggest  that  /3  is  independent  of  the  molecular  diffusivity. 

Suppose  that  the  mean  field  (c)  has  a  length  scale  L  and  that  the  velocity  field  u'  has  a  length 
scale  Lu  (in  the  RW  example  Lu  =  k~x).  The  scale  separation  assumption  is  that 

L»LU.  (53) 

The  inequality  in  (53)  is  exemplified  in  idealized  case  of  (46)  in  which  L  is  infinite.  If  follows  that 
advective  distortion  of  V(c)  generates  d  first  on  the  scale  Lu.  Then,  following  our  arguments  in  lecture 
1,  the  scale  of  d  will  be  exponentially  reduced,  like  exp(— 7^),  where  7  is  roughly  proportional  to  the 
RMS  strain  of  u'.  This  exponential  contraction  continues  until  the  cascade  is  halted  by  molecular 
diffusion  at  the  scale 

Using  arguments  from  lecture  1,  we  can  estimate  that  the  time  taken  for  this  arrest  at  £  is 

^«7’1ln(Lu/^)  •  (55) 

Then  the  smallest  length  scale  in  the  c'-field  is  £  and,  plausibly,  the  gradient  is  Vc'  ~  dRMS/£  where 
dRMs  =  V2Z.  We  now  have  a  simple  estimate  /c(Vc'-Vc')  ~  ^Z.  This  rough  argument  leads  to  the 
closure  in  (52),  with  /?  oc  7,  and  the  caveat  that  t  >  t£. 

We  can  summarize  the  arguments  above  by  rewriting  the  variance  equation  (50)  as 

Zt  +  (u)-VZ  -  DV2Z  =  DeV(c)*  V(c)  -  $2 ,  (if  t  >  U) .  (56) 

The  most  dubious  approximation  is  probably  (52).  To  conclude  this  discussion  we  will  interpret  the 
variance  equation  in  two  specific  examples. 

5.6  Example  1:  the  dispersing  front 

First  consider  the  dispersing  front  in  figure  5.  In  this  example  s  =  k,  =  (u)  =  0  and  we  have  already 
know  from  (28)  that 

Z  =  i[l-«r?(,)],  (57) 

On  the  other  hand,  since  k  =  0,  it  follows  that  D  =  De  and  /?  =  0.  With  these  simplifications  the 
variance  equation  (50)  reduces  to 

Zt  —  DZXX  =  DV(c)'V{c) ,  (58) 

where  (c)  is  the  erf-solution  in  (25).  As  a  consistency  check,  one  can  show  that  (57)  is  the  solution  of 
the  variance  equation  in  (58). 

This  example  shows  that  the  destruction  of  variance  by  molecular  diffusivity  is  not  required  in 
order  to  prevent  an  accumulation  of  variance:  the  source  on  the  right-hand  side  of  (58)  is  balanced 
by  eddy  diffusion. 

5.7  Example  2:  a  large-scale  source 

In  this  second  example  the  tracer  is  injected  by  a  source  s  =  cos  qx  in  (36).  We  also  take  (u) 
that  the  mean  concentration  field  is  obtained  by  solving 

(c)t  -  DV2(c)  =  cosga; ,  =>  (c)  =  —“2  \}  “  cos  qx . 


=  0  so 

(59) 
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Figure  9:  A  simulation  of  the  source  problem,  with  s  —  cos(x/6),  using  the  RW  model  to  generate  u  in  (36). 
There  is  no  molecular  diffusivity  (k  =  0).  The  left-hand  panel  shows  the  whole  domain  (the  length  of  the  side 
l  2t7  )  while  the  right  hand  panel  shows  a  smaller  subdomain  (the  length  of  the  side  is  27t).  The  concentration 
holds  were  generated  by  10  pulses  of  the  renovating  wave  using  r*  =  3  (that  is,  t  =  30). 


(  To  apply  the  diffusion  equation  the  scale  of  the  source,  g“x,  must  be  much  larger  than  the  scale  of 
the  velocity  field.)  A  steady  mean  concentration  pattern  is  established  when  Dq2t  >  1. 

The  concentration  variance  is  determined  by  solving  the  variance  equation  (56) 

Zt  -  DV2Z  =  I1  ~  e-^2*]2  (1  -  cos  2 qx)  -  fc(Vc'  Vc') .  (60) 

In  (GO),  the  solution  in  (59)  has  been  used  to  evaluate  the  source  term  on  the  right  hand  side  and  we 
have  left  the  diffusive  sink  in  its  exact  form. 

It  is  clear  from  (60)  that  the  molecular  diffusion,  k,  plays  an  important  role.  If  k  =  0  then  the 
long,  time  solution  of  (60)  has  a  component  which  eventually  grows  linearly  with  time: 

« =  0 ,  =»  Zoit/2Dq2.  (61) 

Thus,  without  molecular  diffusion,  there  is  “runaway  variance”.  Ultimately,  in  a  single  realization, 
the  mean  field  in  (59)  will  be  buried  under  enormous  fluctuations. 

To  give  an  intuitive  derivation  of  (61)  we  argue  that  with  k  =  0  the  concentration  on  each  fluid 
element  is  determined  by  solving  the  Lagrangian  equation 

^  =  cos  qx(t),  (62) 

where  x(t)  is  the  randomly  changing  x-position  of  the  particle.  Thus,  the  concentration  on  each 
particle  is  undergoing  a  random  walk  along  the  c-axis,  which  is  induced  by  the  random  motion  of  the 
particle  through  the  cosqx  source  function.  The  decorrelation  time  of  this  walk  is  the  time  it  takes  a 
particle  to  diffuse  through  a  distance  of  order  q~l ,  which  is  1/Dq 2.  Thus,  in  a  time  t,  there  are  roughly 
.V (t)  ~  Dq2t  independent  steps  along  the  c-axis.  But  because  the  source  acts  coherently  for  a  time 
l/Dq2  with  a  strength  of  order  unity,  the  step  length  of  this  random  walk  is  roughly  Ac  ~  1/Dq2. 
Thus,  the  mean  square  displacement  of  c  is: 

<c'2)~(Ac)2iV(*)~-T_,  (63) 


36 


Concentration  at  t=  200t=400 


c_.,e  at  t=  200t=400 

RMS 


Figure  10:  This  figure  compares  analytic  results  with  a  numerical  solution  of  (62),  taking  q  =  1/10,  and 
using  the  RW  model  to  generate  u.  The  persistence  parameter  is  r*  =  2  and  the  results  axe  at  t  =  400  (that 
is,  200  renovation  cycles).  The  concentration  c(x,400)  is  calculated  on  a  400  x  400  grid  using  the  method  in 
appendix  B.  In  the  top  panel  there  are  three  curves:  the  concentration  as  a  function  of  0  <  x  <  207r  along  the 
line  y  =  0  (the  jagged  dotted  curve);  the  y-averaged  concentration  defined  in  (64);  the  analytic  result  in  (59) 
(the  smooth  sinusoid).  The  bottom  panel  compares  the  crms  =  obtained  by  solving  (60)  analytically 
with  crms  estimated  using  (64). 


which  is  the  final  result  in  (61). 

It  is  interesting  to  compare  the  analytic  results  in  (59)  and  (60)  with  a  numerical  solution  of  (62). 
Thus  we  must  compute  the  spatial  averages 

1  1 

c(x,  t)  =  -j^  c(x,  y,  t )  d y,  c2RMS ( x ,  t)  =  - J  [c(x,  y,  t)  -  c]2  dy ,  (64) 

using  the  numerical  solution,  and  compare  these  with  the  analytic  results  for  (c)  and  Z  = 

The  best  way  to  make  this  comparison  is  to  obtain  c(x,y,  t)  on  a  regular  grid  in  the  (x,  2/)-plane.  As 
a  bonus,  one  can  then  also  use  contouring  routines  to  make  pretty  pictures  of  the  concentration  field 
(see  figure  9). 

The  concentration  field  is  calculated  on  a  regular  grid  using  the  procedure  described  in  Appendix  C 
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(essentially  the  method  of  characteristics).  Figure  10  shows  good  agreement  between  this  simulation 
and  analytic  results.  Notice  that  in  figure  10  the  variance  Z  peaks  where  V(c)  is  greatest.  This 
illustrates  that  concentration  fluctuations  are  produced  by  advective  distortion  of  the  mean  gradient: 
where  the  mean  gradient  is  large  there  is  lots  of  variance.  But  Z  0  even  where  V(c)  =0  (for 
example,  at  x  =  0  and  x  =  107r  in  figure  10).  Thus,  where  the  source  term  on  the  right  hand  side  of 
(60)  vanishes,  the  diffusive  term  DS72Z  supplies  variance. 

5.8  Cautionary  remarks 

In  the  both  examples  above  there  is  no  molecular  diffusion  (k  =  0)  and  consequently  there  is  no 
destruction  of  variance  by  the  term  k(Vc'-  Vc')  in  (50).  As  a  project  for  a  student,  include  molecular 
diffusion  in  the  RW  model  (perhaps  by  pulsing  diffusion  in  alternation  with  advection)  and  assess  the. 
efficacy  of  this  process.  In  particular,  can  the  closure  in  (52)  be  justified? 
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A  Calculation  of  the  RW  Green’s  function 

In  this  appendix  we  present  an  alternative  calculation  of  the  RW  ensemble  averaged  Green’s  function, 
g(r),  in  (17).  The  unaveraged  Green’s  function,  G(x,  x0,  t),  is  the  solution  of  (10).  Because  the  process 
is  spatially  homogeneous  it  is  harmless  to  take  x0  =  0  so  that 

G(x,0,t»)  =  <5[x  -  r,ssin<p]  <5 [y  +  T„csiny>]  ,  (65) 

where  (s,c)  =  (sin  9,  cos  9).  The  ensemble  average  of  (65)  is  computed  by  integration  over  <p  and  9, 
as  in  (4).  It  is  very  pleasant  that  there  are  two  integrals  and  two  ^-functions.  Thus,  we  first  do  the 
(^-integral  by  noting  that  6[x  -  T*ssin  <p]  is  nonzero  at  the  two  values  of  p  where  sirup  =  x/t,s,  and 
at  those  positions: 

-p-  [a:  —  t»s  sin  p]  =  ±\/t2s2  —  x2  .  (66) 

ay 

Using  the  standard  rule  for  changing  variables  in  a  5-function,  we  find  that  the  average  of  (65)  over 
< p  alone  is 

{G)  _1  <%  +  COtflx)  (67) 

77  \J r2  sin2  9  —  x2 

The  second  integral  over  9  is  performed  by  noting  that  6(y  +  cot  6x)  is  nonzero  at  the  two  values  of  0 
where  cot#  =  - y/x ,  and  at  those  positions 

sin2  6  =  ~YT~~2  ’  ^b  +  a:cot0]  = - -  •  (68) 

xl  ■+-  yl  d0  x 

After  changing  variables  in  the  ^-function  we  recover  g(r)  in  (17). 
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B  Eddy  diffusion  of  variance 


Ignoring  small  molecular  diffusion  (k  =  0),  if  c  satisfies  the  advection  equation  then  any  function  of  c 
satisfies  the  same  equation.  That  is  to  say 


(69) 


where  /(c)  =  c2,  or  exp(c),  etcetera.  Taking  an  ensemble  average,  and  making  the  same  arguments 
for  /(c)  as  for  c,  we  have  that 


(f)t  =  DV2{f).  (70) 

In  the  particular  case  /  =  c2/ 2,  (/)  =  (e)2/2  +  Z  and  (70)  reduces  to 

Zt  =  DV2Z  +  DV{c)-V(c).  (71) 

Matching  the  terms  in  (71)  with  those  in  (50)  we  conclude  that  (u'c'2/2)  =  -DVZ. 

C  Numerical  simulation  of  the  RW  process 

Drawing  figures  9  and  10  requires  that  we  obtain  the  solution  of  (62)  on  a  regular  grid  in  the  (x,  y)- 
plane.  This  is  an  opportunity  to  use  the  method  of  characteristics  and  learn  some  MATLAB  programming 
techniques. 

Equation  (3)  shows  how  the  movement  of  a  particle  in  the  RW  velocity  field  is  equivalent  to  a 
random  map  which  determines  the  position  at  ( n  +  l)r,  in  terms  of  the  previous  position  at  nr,.  If 
this  particle  carries  a  concentration,  c(x,f),  which  changes  because  of  the  cos  qx  source  in  (62),  then 
the  concentration  changes  can  also  be  calculated  and  expressed  as  a  map  in  discrete  time. 

Thus,  suppose  that  the  concentration  on  a  particle  at  time  t  =  nr »  is  cn.  Then  the  change  in 
concentration  during  nr,  <  t  <  (n  +  l)r,  is  obtained  by  integrating 

Dc  ,  .  N1 

—  =  cos  [qxn  +  qun(t  -  nr,)]  ,  (72) 

where  the  constant  x- velocity  of  the  particle  is  un  =  .s„  sin(cnxn  +  snyn  +  ipn),  with  (s„,cn)  = 
(sin#n,cos0n).  The  integral  of  (72)  can  be  written  as 

Ctl=C„+Si,l('1Xw)-sln(gI-).  (73) 

qun  v 

With  equations  (73)  and  (3)  we  can  advance  forward  in  time  and  so  determine  the  concentration  on 
a  particle  at  t  =  nr * . 

However  we  need  to  determine  the  concentration  at  t  =  nr *  at  a  specified  grid  point  x.  The  trick 
is  illustrated  in  the  Matlab  program  below. 


U  Solution  of 

VL 

7.7.  Dc/Dt=cos  (qx)  ; 

7.7. 

7.7.  cos(q  x)  is  a  large-scale  source  and  u  is  the  RW  velocity. 

7.7.  The  RW  streamf unction  is  psi=cos [cos (theta)  x+  sin(theta)y  +  phi] 
clc 
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N=400 ; 
q=l/6 ; 
LL=2*pi/q; 
npulse=10 
tau=3 ; 


XX  Use  an  N*N  grid  in  the  plotting  window 
XX  The  wavenumber  of  the  cos  q  x  source 
•///,  LL  is  the  domain  size 
XX  The  number  of  renovation  cycles 
XX  The  pulse  duration  of  the  wave 


•/0#/0  Lwin  is  the  side  of  the  square  plotting  window. 

XX  Set  Lwin=LL  to  see  the  big  picture.  To  see  small  scale  details, 

XX  try  Lwin  =  2*pi.  We  draw  two  subplots  with  different  Lwin’s 
nloop=0 ; 

for  Lwin= [LL  2*pi] 
nloop=nloop+l 

x=linspace(0,Lwin,N)  ;  X*/.  x  is  the  coordinate  in  the  plotting  window. 

h=x(2);  XX  The  grid  spacing  in  the  plotting  window 

for  j=l:N 

jj=[((j-l)*N+l):(j*N)]; 
pos ( j j , l)=zeros (N, l)+(j-l)*h; 
pos(j j ,2)=x> ; 
end 

conc=zeros (N*N , 1) ; 

XX  The  position  of  the  N~2  particles  are  stored  in  pos  with 
XX  N~2  rows  and  2  columns.  Each  vertical  segment  of 

XX  length  N  in  pos  contains  particles  with  the  same  initial  x-position. 

XX  the  column  vector  cone  contains  the  concentration  on  the 
XX  N*N  particles  in  pos.  Initially,  conc=0  at  the  N*N 
XX  grid  points.  Then  we  integrate 

XX  backwards  in  time  to  find  the  concentration  change. 

for  k=l:l:npulse 
theta=rand*2*pi ; 

wavevec= [cos (theta) , sin (theta)] y  ; 
phase=rand*2*pi ; 

vel=sin(pos*wavevec+phase)*[wavevec(2) ,-wavevec(l)] ; 
conc=conc-sin(q*pos (:,!))./ (q*vel( : , 1) ) ; 
pos=pos+tau*vel ; 

conc=conc  +sin(q*pos(: ,1)) ./(q*vel( : ,1)) ; 
end 


XX  Emerging  from  this  loop,  we  have  the  the  new  positions 
XX  and  the  new  concentration 

conc=reshape(conc,N,N)  ;  XX  cone  is  reshaped  into  an  N*N  matrix 

hh=subplot (1 ,2 ,nloop) 

colormap ( ’ gray  * ) 

images c (x , x , cone) 

axis  equal 

xlabelOx’) 

ylabeK’yO 
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axis([0  Lwin  0  Lwin] ) 
set(hh,  ’ydir’ ,  ’norm’) 

end 
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Lecture  3:  Stretching 


1  Line  stretching 

In  the  previous  lecture  we  emphasized  that  the  destruction  of  tracer  variance  by  molecular  diffusivity 
relies  on  the  increase  of  Vc  by  stirring.  Thus  the  term  k(Vc'-Vc')  in  the  variance  budget  eventually 
becomes  important,  even  though  the  molecular  diffusivity  k  is  very  small.  One  goal  of  this  lecture  is  to 
understand  in  more  detail  how  tracer  gradients  in  a  moving  fluid  are  amplified  by  simple  velocity  fields. 
We  will  assume  that  k  —  0  so  that  there  is  stirring  without  mixing.  This  is  a  good  approximation 
provided  that  the  smallest  scale  in  the  tracer  field  is  much  greater  than  the  length  l  =  a/k/q  that  we 
identified  in  lecture  1. 

Gradient  amplification  is  closely  related  to  the  stretching  of  material  lines,  a  subject  that  was 
opened  by  Batchelor  in  1952.  A  material  line  is  a  curve  that  consists  always  of  the  same  fluid 
particles.  Batchelor’s  main  conclusion  is  that  there  is  a  timescale  governing  the  ultimate  growth  of  an 
infinitesimal  line  element,  but  no  length  scale  other  than  that  of  the  element  itself.  These  dimensional 
considerations  force  the  conclusion  that  the  element  grows  exponentially, 

t  =  ,  (1) 

where  7  is  a  constant  with  dimensions  of  inverse  time,  related  to  the  timescale  that  Batchelor  had  in 
mind. 

Just  as  some  close  particle  pairs  separate  exponentially,  other  pairs  starting  at  distant  points  are 
brought  close  together.  This  might  seem  paradoxical  until  one  recalls  the  folded  tracer  patterns  evident 
in  Welander’s  1955  experiments  (see  the  final  figures  in  lecture  1).  If  two  closely  approaching  particles 
are  carrying  different  values  of  c  then  the  gradient  Vc  will  be  amplified.  Thus,  as  a  corollary  of  (1) 
we  expect  that  |  Vc|  ~  |  Vco|  exp(7 1).  It  is  through  this  exponential  amplification  of  the  concentration 
gradients  that  the  small  molecular  diffusivity  k  is  able  eventually  to  destroy  tracer  variance. 


1.1  Material  line  elements  and  tracer  gradients 

Using  a  geometric  argument  (see  figure  1)  we  can  give  a  proof-by-intimidation  that  a  material  line 
element,  £(x,t),  attached  to  a  fluid  element  evolves  according  to 


Li 

Dt 


=  (€-v)«. 


(2) 


Here  the  “convective  derivative”  is  D/Dt  =  d/dt  +  u  ■  V.  The  field  of  line  elements  can  be  visualized 
a  collection  of  tiny  straight  arrows  attached  to  each  moving  particle  of  fluid.  Then  (2)  describes  the 
evolution  of  this  collection  of  arrows.  Notice  that  (2)  refers  to  an  infinitesimal  line  element  £.  If  the 
length  of  a  material  line  is  comparable  to  the  scale  of  u  there  is  no  longer  a  simple  relation  between 
the  stretching  of  the  material  line  and  local  properties  of  u,  such  as  Vu. 

Taking  the  gradient  of  the  tracer  equation 


(3) 
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x  +  £  +  u(x  +  t)  8t 


t)  bt 


Figure  1:  The  line  element  £  is  short  enough  to  remain  straight  and  to  experience  a  strain  that  is  uniform 
over  its  length  during  the  time  St .  Proof  by  intimidation  of  (2)  :  J£  =  [u(x  +  £,£)  —  u{x,t)]St,  and  take 
{St ,  £,  <5£)  — >•  0. 


gives 

^  =  -(Vc-V)«.  (4) 

Despite  the  difference  in  the  sign  of  the  right  hand  sides  of  (2)  and  (4)  there  is  a  close  connection 
between  the  solutions  of  the  two  equations. 

To  emphasize  the  connection  between  Vc  and  £,  we  mention  the  conservation  law 

^(Vc-0  =  o.  (5) 

(Meteorologists  and  oceanographers  might  recognize  (5)  as  a  relative  of  potential  vorticity  conserva¬ 
tion.)  Later  in  this  lecture  (5)  is  used  to  deduce  Vc  from  £. 

The  easy  way  to  prove  (5)  is  to  consider  a  pair  of  particles  separated  by  a  small  displacement  £. 
If  the  concentration  carried  by  the  first  particle  is  ci,  and  that  of  the  second  particle  is  C2  =  C\  4-  dc, 
then  do  —  £-Vc.  Thus  (5)  is  equivalent  to  the  “obvious”  fact  that  dc  is  conserved  as  the  two  particles 
move. 

The  difficult  way  to  prove  (5)  is  to  take  the  dot  product  of  Vc  with  (2)  and  add  this  to  the  dot 
product  of  £  with  (4).  Performing  some  nonobvious  algebra,  perhaps  with  Mathematica  or  Maple, 
one  can  eventually  simplify  the  mess  to  (5).  Suffering  through  this  tedious  exercise  will  convince  the 
student  that  the  earlier,  easy  proof  is  worthy  of  serious  attention. 

1.2  Euler ian  versus  Lagrangian:  the  golden  rule 

Particle  trajectories,  x  =  x{xq ,  t),  are  determined  by  solving  the  differential  equations 

Dx 

—  =  u(x,  t) ,  x(0)  =  x0  .  (6) 

The  solution  of  the  differential  equation  above  defines  the  particle  position,  x,  as  a  function  of  the 
two  independent  variables,  Xq  and  t.  Using  this  time-dependent  mapping  between  x  and  Xq ,  we  can 
take  a  problem  posed  in  terms  of  x  and  t  (the  Eulerian  formulation)  and  change  variables  to  obtain 
an  equivalent  formulation  in  terms  of  Xq  and  t  (the  Lagrangian  formulation). 
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In  the  Eulerian  view,  the  independent  variables  are  x  =  ( x,y,z )  and  t.  The  convective  derivative, 


D  d  d  d  d 

m-Ft+ute+vd^+wd~z' 


(7) 


is  a  differential  operator  involving  all  of  the  independent  variables. 

In  the  Lagrangian  view,  the  independent  variables  are  x0  and  tf  and  x(x0,t()  is  a  dependent 
variable.  As  an  accounting  device,  the  time  variable  is  decorated  with  a  prime  to  emphasize  that 
a  t'-derivative  means  that  the  independent  variables  are  x$.  To  move  between  the  Eulerian  and 
Lagrangian  representations  notice  that 


dt 

dv 


=  1, 


and  Qj(x,y,z)  =  (u,v,w) . 


(8) 


The  second  equation  above  is  the  definition  of  velocity,  u  =  (u,v,w). 
Using  (8),  the  rule  for  converting  partial  derivatives  is 


d  _  d  dx  d  dy  d  dz  d  __  D 

dt '  dt  +  dt'  dx  +  dt '  dy  +  df  dz  Dt  ’ 


Equation  (9)  is  the  golden  rule  that  enables  us  to  interpret  expressions  such  as 

■^(unknown)  =  RHS  (10) 

in  either  Eulerian  or  Lagrangian  terms.  Using  the  golden  rule  we  can  dispense  with  the  prime  that 
decorates  the  Lagrangian  time  variable. 

In  the  Eulerian  interpretation  we  must  express  the  RHS  in  (10)  as  a  function  of  x,  y,  z  and  t 
and  use  the  Eulerian  definition  of  the  convective  derivative  in  (7).  Then  (10)  is  a  partial  differential 
equation  for  the  unknown. 

In  the  Lagrangian  interpretation  D/Dt  is  the  same  as  a  simple  time  derivative  and  we  must  express 
the  RHS  of  (10)  as  a  function  of  xo,  yo,  z0  and  t.  Then  (10)  is  a  ordinary  differential  equation  for  the 
unknown. 

As  an  illustration  of  the  transformation  between  Eulerian  and  Lagrangian  variables,  consider  the 
steady,  unidirectional  velocity  field  u  =  [u(y),0].  The  solution  of  (6)  is 


x  -  x0  +  u{y)t,  y  =  yo- 


(11) 


In  this  example  it  is  a  simple  matter  to  express  (x,y)  in  terms  of  (xo,2/o)  and  vice  versa. 

The  line-stretching  equation,  (2),  has  the  same  form  as  (10).  For  the  same  unidirectional  velocity 
field,  using  components,  £  =  (£5^)5  we  have  in  Lagrangian  variables 

% =’“■«■  <i2> 
(We  have  used  the  golden  rule  (9)  above.)  Equation  (12)  is  an  ordinary  differential  equation  and  the 
solution  is 


£  =  £o (*o,yo)  +tTio(x0,yo)u'(yo) ,  rj  =  T){x0,y0)  ■  (13) 

Using  (11),  we  can  write  (13)  in  terms  of  Eulerian  variables  as 

£  =  to[x-u(y)t,y}  +  tr}0[x-u(y)t,y]u'{y),  rj  =  r]0[x  -  u(y)t,  y] .  (14) 

We  can  alternatively  view  (12)  in  terms  of  Eulerian  variables  and  in  this  case  we  are  confronted  with 
the  partial  differential  equations 

g +*>!-*«.  !+“(»>!=»•  <is> 

It  is  easy  to  check  by  substitution  that  (14)  is  the  solution  of  (15). 
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1.3  Motion  is  equivalent  to  mapping 


We  obtained  (2)  using  the  geometric  argument  in  figure  1.  Now  we  admire  some  different  scenery  by 
taking  an  algebraic  path  to  (2).  Our  itinerary  emphasizes  that  the  solutions  of  (6)  define  a  mapping 
of  the  space  Xo  of  initial  coordinates  onto  the  space  x ,  and  hence  the  title  of  this  section. 

Using  indicial  notation  (summation  implied  over  repeated  indices),  it  follows  from  the  chain  rule 
that 

dxi 

d  Xi  =  - dcc0j.  (16) 

C/Xqj 


Taking  the  time  derivative  of  (16),  and  keeping  in  mind  that  Xoj  is  independent  of  t,  gives 


D  f  dui 

Dt  dxoj  Xoj  dx0j  dxk 


dui  dx0j  ,  dui 

- -d  Xk  —  — G-Xn. 

dxj  3 


(17) 


(We  have  used  the  golden  rule.)  Making  the  identification  dx  £  we  obtain  (2). 

The  motion  of  a  fluid  defines  a  family  of  mappings  from  the  space  of  initial  coordinates,  Xq ,  onto 
the  space  of  coordinates  x .  At  t  =  0  this  is  just  the  identity  map  but  as  t  increases  the  map  from  Xo 
to  x  can  become  very  complicated.  Equation  (16)  defines  the  Jacobian  matrix, 


ij  — 


dxi 


dx< 


o  j 


(18) 


of  the  map. 

With  these  algebraic  formalities  we  have  given  an  alternative  derivation  of  (2)  and,  as  a  bonus,  we 
have  also  found  a  representation  of  the  solution: 

i  =  Ji  o-  (19) 


The  solution  above  is  known  as  Cauchy’s  solution. 

In  (19)  there  is  no  assumption  that  the  flow  is  incompressible.  If  the  flow  is  incompressible  (i.e., 
if  V-it  =  0)  then  mapping  from  Xq  to  x  conserves  volume.  In  this  case,  det  J  —  \. 


2  Two-dimensional  incompressible  flow 


In  the  case  of  a  two-dimensional  incompressible  flow  there  is  a  streamfunction  ip  =  ip{x,t)  such  that 
u  —  (u,u)=(— In  terms  of  ip,  (2)  can  be  written  as: 

ft-Wi,  "here  W  ) .  W 

The  trace  of  W  is  zero  and  the  determinant  is  det(W)  =  ipxx^Pyy  ~  The  solution  of  (20)  can  be 
written  as 

^  =  exp  (21) 

Thus,  using  (19),  we  obtain  a  fundamental  connection  between  J(t)  and  W(t): 

J{t)  =  exp  .  (22) 


Because  tr  W  =  0  it  follows1  that  det  J  —  1.  This  is,  of  course,  just  another  way  of  saying  that  if  the 
flow  is  incompressible  then  the  map  from  x0  to  x  is  area  preserving. 

1For  a  square  matrix  M 

deteM  =etrM. 
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det  W>0 


det  W<0  det  W=0 


Figure  2:  The  sign  of  det(W)  =  tpxxipyy  —  determines  the  streamline  pattern. 


2.1  The  steady  case 

Because  (20)  is  linear  the  solution  is  straightforward  if  the  velocity  field  in  the  Lagrangian  frame  is 
steady .  Thus 

=  =>  7  =  ±V-  det  W ,  (23) 

where 

det  W  =  Ipzxlpyy  -  •  (24) 

There  are  three  cases,  which  correspond  to  the  three  panels  in  figure  2: 

Elliptic:  If  det  W  >  0,  then  7  is  imaginary  and  the  local  streamfunction  has  elliptic  streamlines;  £ 
changes  periodically  in  time  and  there  is  no  exponential  stretching. 

Hyperbolic:  If  det  W  <0  then  7  is  real  and  the  streamfunction  is  locally  hyperbolic.  Then,  as  in 
lecture  1 ,  material  line  elements  will  be  stretched  exponentially  in  one  direction  and  compressed 
in  the  other. 

Transitional:  If  det  W  =  0  then  |£|  grows  linearly  with  time. 

Following  Okubo  (1970)  and  Weiss  (1991),  the  sign  of  detW  has  been  used  to  diagnose  two- 
dimensional  turbulence  simulations  (e.g.,  McWilliams  1984).  Assuming  that  det  W  is  changing  slowly 
in  the  Lagrangian  frame,  one  argues  that  the  result  in  (23)  applies  “quasistatically” .  For  instance, 
using  simulations  of  two-dimensional  turbulence,  McWilliams  shows  that  in  the  core  of  a  strong  vortex 
tyxzi:yV  >  0.  The  interpretation  is  that  there  is  no  exponential  stretching  of  line  elements  in 

vortex  cores,  which  indicates  that  these  regions  are  isolated  patches  of  laminar  flow.  This  so-called 
Okubo- Weiss  criterion  is  only  a  rough  guide  to  the  stretching  properties  of  complicated  flows;  for  a 
critique  and  more  refined  results  see  Hua  and  Klein  (1999). 

One  pleasant  aspect  of  the  steady  two-dimensional  case  is  that  it  is  possible  to  explicitly  calculate 
the  matrix  exponential  J(t)  =  exp (tW).  (This  is  not  the  case  in  three  dimensions.)  Begin  by  noting 
that 

W2  +  (det  W)X  =  0 ,  (25) 

where  X  is  the  2x2  identity  matrix.  The  result  above  is  easily  checked  by  direct  evaluation,  but  (25) 
is  also  a  consequence  of  tr  W  =  0  and  the  Cayley-Hamilton  theorem.  When  (25)  is  substituted  into 
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the  definition  of  the  matrix  exponential: 


f 2  fz 

J  =  exp  (tW)  =T+tW  +  -W2  +  -W3  + 

2  6 


(26) 


the  sum  collapses  to 


/  v  sin  ( Vdet  W 

J  —  cos  ( VdetW - V- - 

V  )  VdetW 


W. 


We  now  use  the  result  above  to  formulate  a  renovation  model. 


(27) 


2.2  The  cr-(  model 

We  construct  the  “cr- £”  model  using  the  matrix  equation  in  (20).  The  idea  is  to  define  an  ensemble  of 
stretching  flows  in  which  the  2x2  matrix  W  is  piecewise  constant  in  the  intervals  Xn  =  {t :  (n  —  1  )r  < 
t  <  nr};  r  is  the  “decorrelation  time”.  We  use  the  following  representation  of  W  in  the  interval  Xn : 


■wjrr  _  Cn  f  0  “1  ^  f  —  cos  26  n  sin  26  n  \ 

n"~2\l  0  )  2  \  sin  20  n  cos2  0n  J 


(28) 


Cn  is  the  vorticity  and  an  the  strain.  Isotropy  is  ensured  by  picking  the  random  angle  0  <  6n  <  2tt 
from  a  uniform  density.  (We  use  2 8n  because  the  principal  strain  axes  are. at  angle  9n  to  the  coordinate 
axes,  and  tbey  specify  a  orientation  but  not  an  direction.  That  is,  the  strain  axes  are  like  vectors 
without  an  arrow.) 

Because  Wn  is  constant  in  Xn  the  calculation  of  stretching  rates  can  be  reduced  to  a  product 
of  random  matrices.  The  terms  in  the  product  are  exp(rWn)  and,  using  (27),  one  can  obtain  this 
matrix  exponential  analytically.  There  is  an  extensive  and  difficult  literature  devoted  to  calculating 
the  statistical  properties  of  products  of  random  matrices  (e.g.,  Crisanti,  Paladin  h  Vulpiani,  1993). 
It  is  fortunate  that  we  can  avoid  these  complications  by  using  the  isotropy  of  the  cr-(  model  to  reduce 
averages  of  matrix  products  to  averages  of  scalar  products. 

Two  important  properties  of  Wn  are  easily  related  to  the  vorticity  and  the  strain: 

det wn  =  \  (C  -  CT2)  ,  tr  (wlwn)  =  \{£  +  a2)  .  (29) 

In  the  examples  that  follow  we  will  use  cr-(  ensembles  which  model  spatially  homogeneous  flows,  for 
which  (a2)  =  (C2)  (by  the  way:  this  is  not  obvious).  In  this  case  (detWn)  =  0  and  “on  average”  the 
Okubo- Weiss  criterion  is  zero. 

We  employ  (27)  to  obtain  an  explicit  expression  for  the  matrix  XTn  —  exp(rWn).  It  turns  out 
that  we  do  not  need  the  full  details:  all  that  is  required  is 

itr  (jlJ n)  =  1  +  H(crn, rn,  r) ,  (30) 


where 


-(a,C,r)  =  f1  ~  cos  (VC2  -tf2  -r)]  •  (31) 

The  “trace  formula”  above  should  be  known  to  experts  on  two-dimensional  stretching  problems,  but 
I  have  not  found  (30)  in  the  literature. 
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2.2.1  Stretching  of  squared  length 

Consider  the  first  interval  Zi,  and  suppose  that  at  t  =  0,  £  =  £0(cosx,sinx).  At  t  =  r  we  have 

£l=$JirJiSo-  (32) 

Now  we  use  isotropy  to  average  (32)  over  the  random  direction  x  of  the  element  £0.  A  trivial  calculation 
gives 

((V4)2)x  =  \tv  .  (33) 

Tin'  RHS  of  (33)  is  given  explicitly  in  (30).  We  must  still  average  over  the  random  variables  a  and  C- 
This  gives 

((4/£o)2)  =  1  +  J fv(a,  QE(a,  (,  r)  d<rdC  ,  (34) 

where  "P(cr,  is  the  joint  PDF  of  a  and  C  2- 

We  are  now  well  on  our  way  to  computing  the  rate  at  which  (l  grows  with  the  number  of  renovation 
cycles,  n.  The  average  stretching  of  l2  in  each  ln  is  independent  of  the  previous  I’s.  Thus,  to  compute 
the  growth  of  £2  over  n  renovation  cycles,  we  can  simply  raise  the  average  ^-stretching  factor  in  a 
single  I  to  the  n’th  power: 

((4/4)2>  =  |l  +  |/'P(^C)=(^C,T)dfrdc}  •  (35) 

Th<*  stretching  rate  72  is  defined  by 

72  =  lim  7  ln  [((fn/4)2)1/2l  .  (36) 

t-KJO  t  L  J 

The  notation  72  anticipates  section  4  in  which  we  will  define  a  stretching  rate  7P  which  measures  the 
crowth  of  {{£n/£o)p)- 

I'sing  n  =  t/r,  we  have  from  (35) 

72  =  irln{1  +  / ‘/^OS(a,C,r)dadc}.  (37) 

1..  further  simplify  the  integral  above  we  must  specify  the  probability  density  function  V{a,  ()  (ex¬ 
amples  follow). 

2.2.2  Randomly  oriented  Couette  flows 

A>  a  first  example,  suppose  that  in  each  In  the  random  variables  Cn  and  an  are  independently  and 
identically  distributed,  each  equal  to  ±/3  with  equal  probability.  In  this  case 

V(<t,  C)  =  \  [S(tr  +  P)+  6 (a  -  /?)]  [<5(C  +  /?)  +  S{ C  -  13)}  .  (38) 

This  ensemble  is  a  set  of  randomly  oriented  Couette  flow,  such  as  the  third  case  in  figure  2.  According 
to  the  Okubo- Weiss  criterion  there  should  be  no  stretching  because  det  W  is  identically  zero.  However, 
this  is  wrong. 

2If  a  and  <  are  independent  and  identically  distributed  random  varaibles  then  V((r,0  =  V((x)V(0- 
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Figure  3:  A  comparison  of  the  exponent  72  in  (40)  with  a  simulation  (the  dotted  curves)  of  the  random 
Couette  flow.  The  simulation  is  conducted  by  creating  random  matrices  according  to  the  recipe  in  (28)  and 
(38),  and  then  computing  the  matrix  product.  The  iteration  number  is  the  number  of  matrices  in  the  product. 
To  get  reasonable  agreement  between  the  simulation  and  the  analytic  result  in  (40)  one  must  ensemble  average 
over  a  large  number  of  realizations  (4000  in  the  figure  above).  The  discrepancies  evident  at  large  iteration 
number  can  be  reduced  by  using  more  realizations.  The  figure  also  shows  a  comparison  of  the  exponent  70  in 
(73)  with  simulation. 


The  recipe  in  (38)  leads  to  trivial  calculations  because  (2  =  an  3  =  /?2r2/ 2.  Thus,  even 
without  averaging  over  a  and  C? 

Itr  Jn'j  =  1  +  1/?2t2  ,  (39) 

and  it  follows  that 


(See  Figure  3.)  The  nonzero  exponential  stretching,  which  occurs  even  though  det  W  =  0,  is  due  to 
the  realignment  of  a  material  element  with  respect  to  the  direction  of  extension  of  the  velocity  field 
which  occurs  at  t  =  nr.  In  the  limit  of  a  very  slowly  changing  velocity  field,  r  -4  00,  the  stretching 
rate  vanishes  because  there  are  fewer  realignment  events.  This  is  the  revenge  of  the  Okubo- Weiss 
criterion. 
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Figure  4:  The  nondimensional  stretching  exponents  72//?  in  (41)  as  a  function  of  fir  for  various  values  of  q.  If 
<?=l/2,  then  det  W  is  zero  identically  and  7  ->  0  as  r  00.  When  slightly  less  than  1/2,  and  r  is  sufficiently 
large,  the  occasional  hyperbolic  points  can  make  a  large  contribution  to  the  stretching  exponent  72. 


2.2.3  An  example  with  det  FT  ^  0 

A  more  interesting  stretching  ensemble  is  defined  by  taking  an  and  ( n  to  be  identical  and  independently 
distributed  random  variables  equal  to  /?  with  probability  q ,  -/?  with  probability  q ,  or  zero  with 
probability  1  -  2 q.  With  this  prescription  there  is  a  hyperbolic  point  in  Zn,  as  in  the  middle  panel  of 
figure  2,  with  probability  2q(l-2q). 

One  can  calculate  72  in  (37)  by  enumeration  and  averaging  over  the  nine  possible  pairs  (crn,£n). 
Calculation  gives 

72  =  ^  In  {1  +  2g2/?V  +  2,(1  -  2q)  (cosh/?r  -  1)}  .  (41) 

Figure  4  shows  the  nondimensional  72//?  as  a  function  of  0t  for  various  values  of  q.  From  figure  4  we 
conclude  that  while  instantaneous  hyperbolic  points  are  not  essential  for  exponential  stretching,  they 
do  help,  especially  if  the  correlation  time  r  is  long. 
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2.2.4  The  Batchelor  and  Kraichnan  limits 

The  calculation  of  stretching  exponents  in  this  section  does  not  follow  the  historical  path.  The 
pioneering  papers  by  Batchelor  (1959)  and  Kraichnan  (1974)  considered  limiting  cases  —  slowly 
decorrelating  in  the  case  of  Batchelor  and  rapidly  decorrelating  in  the  case  of  Kraichnan  —  in  which 
stretching  rates  can  be  calculated  approximately.  A  major  advantage  of  these  approximations  is  that 
they  work  equally  well  in  two  and  three  dimensional  space.  On  the  other  hand,  by  considering  exactly 
soluble  two-dimensional  models  we  can  extract  the  Batchelor  and  Kraichnan  limits  as  special  cases. 

Batchelor  (1959)  considered  stretching  by  slowly  decorrelating  velocity  fields.  This  is  the  limit 
in  which  £r  and  or  are  large.  Batchelor’s  main  conclusion  is  that  in  this  quasisteady  limit  the  net 
stretching  is  dominated  by  hyperbolic  straining  events.  Indeed,  this  conclusion  is  illustrated  by  the 
exact  result  for  72  which  is  plotted  in  figure  4. 

Kraichnan  (1974)  considered  the  opposite  limit  in  which  Cj  and  <rr  are  small.  In  this  rapidly 
decorrelating  limit  we  can  simplify  the  exact  expression  in  (37)  by  noting  that  S  «  (err)2/ 2  <C  1. 
Thus,  simplifying  (37),  we  find  that  the  stretching  rate  is 

72«i(<72)T,  (42) 


independent  of  the  vorticity. 

2.3  The  renovating  wave  model 

In  this  section  we  calculate  the  average  growth  of  using  the  renovating  wave  (RW)  model.  It  is 
interesting  to  see  how  this  calculation  can  be  done  without  using  matrix  identities  such  as  (27). 
Begin  by  recalling  the  definition  of  the  RW  model.  The  RW  streamfunction  is 

Xn  =  (n  -  1)t*  <t  <nr*  :  'iftn  —  cos  [cos  6n  x  4-  sin  0n  y  +  <pn].  (43) 

In  (43),  6n  and  tpn  are  random  phases  and  r*  is  the  decorrelation  time.  The  random  phases  are 
reinitialized  at  t  =  nr*  so  there  is  the  complete  and  sudden  loss  of  memory  at  these  instants.  (In  this 
section  we  use  the  dimensionless  version  of  the  RW  model;  the  parameter  r*  =  rkU  is  the  ratio  of  the 
correlation  time  r  to  the  maximum  shear  of  the  sinusoidal  wave  kU .) 

The  renovating  wave  model  is  equivalent  to  the  random  map 

(^n+l 5  2/n+l)  =  (^njZ/n)  (^n?  Cn)  Sm[cnXn  +  Snyn  ~h  <£,n]'7"*  5  (44) 

where  (cn,  sn)  =  (cos0n,sin#n).  The  Jacobian  matrix  can  easily  be  obtained  by  differentiation  of 
(44):. 


J{n)  =  exp(r*VU(n)) 


1  +  CnSnT*1pn  4r*V’n 

-C^T.^n  1  -  CnSnT»1pn 


Notice  that  det  J ^  =  1:  the  map  is  area  preserving. 

Using  we  can  track  the  stretching  of  an  infinitesimal  material  element  as 


£n+l  =  =*  Cl  =  C  l£n+l  = 


where  7C^  =  >^7'(n)T  Explicitly: 


7C(n)  = 


(1  +  CnSnlpnT*)2  +  C^n Tl  (4  -  4)</,nr*  +  CnS„l/>2Tt2 
(4  -  4)^nT»  +  CnSn1p2T2  (1  -  CnSnlpnU )2  +  4^4 


(45) 


(46) 


(47) 
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To  compute  the  stretching  rate  we  consider  an  element  which  has  length  at  t  —  0.  Because  the 
problem  is  isotropic,  it  is  harmless  to  choose  the  coordinate  system  so  that  this  element  lies  along  the 
x-axis:  £0  “  A>(lj  0).  After  the  first  iteration  of  the  map: 

ei  =  =  [(1  +  CiSiV'iT,)2  +  l\.  (48) 

Averaging  (48)  over  the  phases  6\  and  <pi  gives 

<(V4)2>  =  (l  +  :f)  •  (49) 

If  you  are  suspicious  of  the  argument  above,  then  you  might  prefer  to  align  the  initial  material 
element  at  an  arbitrary  angle,  say  £0  =  £0  (cos*,  sin*).  Repeating  the  calculation,  we  now  find  that 


Averaging  (50)  over  6\  and  (pi,  we  recover  (49). 

Because  each  J (n)  is  independent  of  the  earlier  J's  the  average  growth  of  £2  is 

«W«2)=(l  +  |)  •  (51) 

Using  t  =  nr*,  (51)  can  be  written  as 

<(V4)2>1/2  =  .  72  =  ^  In  (i  +  f )  •  (52) 

Aside  from  notational  differences,  the  expression  above  for  72  is  identical  to  (40). 

3  Amplification  of  concentration  gradients 

In  this  section  we  discuss  the  amplification  of  Vc  which  occurs  when  a  passive  scalar  is  advected  by 
a  random  flow  in  two  dimensions. 

Back  in  (4)  we  noted  that  the  quantity  £•  Vc  satisfies  the  conservation  equation 

Vc)  =  0.  (53) 

Equation  (53)  enables  us  to  use  our  earlier  results  concerning  the  stretching  of  material  elements  to 
analyze  gradient  amplification.  In  fact,  using  (53),  we  can  obtain  Vc  from  £.  The  first  step  is  to 
construct  a  basis  by  considering  the  following  initial  value  problem: 

=  (£fc’V)u,  with  initial  conditions  ^x(a:,0)=x,  £2(x,0)  =  (54) 

where  the  unit  vectors  of  the  coordinate  system  are  x,y,£.  As  the  fluid  moves,  the  parallelogram 
spanned  by  and  £2  will  deform.  But  because  u  is  incompressible,  the  area  of  the  parallelogram  is 
constant  and  so 

£i  x  £2  =  (for  a11  *)*  (55) 

If  we  can  solve  (54)  for  £1?  then  we  can  use  (53)  and  (55)  to  calculate  £2  and  Vc. 
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Figure  5:  Numerical  solution  of  the  renovating  wave  model  with  r  =  2.  The  initial  condition  is  c(x,  y,  0) 
Already,  at  t  =  6r,  I  Vcl  is  greatly  amplified  in  some  regions. 


co  ^  eg  o 


Figure  6:  A  numerical  solution  of  the  renovating  ^ 
y.  The  plots  show  the  values  of  c  and  |Vc|  along 
strong  spatial  intermittency. 


model  with  r,=  1.  The  initial  condition  is  c(z,  y,  0)  = 
slice  x  =  0.  After  20  iterations,  |Vc|  has  developed 


As  an  example  of  this  procedure,  suppose  that  the  initial  condition  is  c(sc,  0)  =  y.  Then  it  follows 
from  (53)  that: 

^•Vc^O  and  £2*Vc=l  (for  alH).  (56) 

Using  (55)  and  (56)  we  see  that 


Vc  =  £  x£x.  (57) 

Thus,  in  this  example,  once  we  calculate  we  obtain  Vc  as  a  bonus. 

Figure  5  displays  the  numerical  solution  for  c  and  |Vc|  after  6  iterations  of  the  renovating  wave 
model.  The  initial  condition  is  c(x,0)  =  y,  so  that  Vc(a?,0)  =  y;  the  decorrelation  time  is  r  =  2. 
The  field  in  figure  5  is  obtained  using  a  256  x  256  grid.  To  find  c  at  the  grid  point  x  at  time  t  =  nr, 
one  iterates  the  renovating  wave  model  backwards  in  time  till  the  initial  location  (a,  b )  is  determined, 
and  then  c(x,t)  =  b.  In  parallel  with  this  backwards  iteration,  €(x,nr)  is  computed  by  matrix 
multiplication  of  the  J ^  defined  in  (45),  and  then  Vc  is  given  by  (57). 

An  important  feature  of  stirring  is  the  development  of  intermittency  in  the  concentration  gradient, 
|Vc|.  In  figure  6  the  development  of  intermittency  is  illustrated,  again  using  the  renovating  wave 
model.  After  20  iterations  there  are  “hotspots”  in  which  large  values  of  |Vc|  are  concentrated. 
Without  diffusion,  the  gradient  of  c  condenses  onto  a  fractal  set  as  the  number  of  iterations  increases 
(Varosi,  Antonsen  &  Ott  1991). 

4  Multiplicative  random  variables 

In  our  solution  of  the  cr-C  model  in  section  2  we  used  isotropy  to  reduce  a  product  of  random  matrices 
to  a  product  of  random  scalars  e.g.,  see  equations  (33),  (34)  and  the  following  discussion.  The  main 
point  of  this  section  is  that  the  statistical  properties  of  isotropic  line-element  stretching  are  bedevilled 
by  the  large  fluctuations  which  are  characteristic  of  products  of  random  variables.  Indeed,  figure  6 
shows  that  there  are  large  fluctuations  in  £2  =  |£|2  =  |Vc|2.  If  one  is  attempting  to  measure  the 
variance  dissipation,  /c(Vc-Vc),  then  the  intermittent  structure  of  Vc  in  figure  5  might  pose  a 
sampling  problem.  Imagine  steering  a  ship  through  the  field  in  figure  5  and  making  occasional  point 
measurements  of  V  c.  If  the  density  of  measurements  is  too  low  then  one  might  easily  miss  the  gradient 
hot-spots  and  so  grossly  underestimate  /c(Vc-Vc). 

4.1  Most  probable  values  versus  mean  values 

We  begin  by  stepping  back  from  the  stirring  problem,  and  making  some  general  remarks  about  mul¬ 
tiplicative  random  processes.  Suppose  that  a  random  quantity,  X ,  is  formed  by  taking  the  product  of 
N  independent  and  identically  distributed  random  variables 

X  =  X1X2  *  *  -xpj.  (58) 

What  can  we  say  about  the  statistical  properties  of  X? 

The  most  nonintuitive  aspect  of  X  in  (58)  is  the  crucial  distinction  which  must  be  made  between 
the  mean  value  of  X  and  the  most  probable  value  of  X.  As  an  illustration,  it  is  useful  to  consider  an 
extreme  case  in  which  each  Xk  in  (58)  is  either  Xk  =  0  or  Xk  =  2  with  equal  probability.  Then  the 
sample  space  consists  of  2N  sequences  of  zeros  and  two’s.  For  all  but  one  those  sequences,  X  =  0;  in 
the  remaining  single  case  X  =  2N .  Thus,  the  most  probable  (that  is,  most  frequently  occuring)  value 
of  X  is 


Xmp  —  0 . 


(59) 
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On  the  other  hand,  the  mean  of  X  is 


(X)  = 


sum  all  the  X’s  from  different  realizations 
number  of  realizations 


=  1 


(60) 


Notice  that  one  can  also  calculate  ( X )  by  arguing  that  (xk)  =  1  and,  since  the  x^s  are  independent, 
<■ X)  =  (xk)N  =  l 

The  example  above  is  representative  of  multiplicative  processes  in  that  extreme  events,  although 
exponentially  rare  if  iV  1,  are  exponentially  different  from  typical  or  most  probable  events.  Thus, 
for  the  product  of  N  random  variables  the  ratio  (X)/Xmp  diverges  exponentially  as  N  -»  oo.  On  the 
other  hand,  for  the  sum  of  N  random  variables  the  most  probable  outcome  is  a  good  approximation 
of  the  mean  outcome.  Perhaps  this  is  why  people  have  an  intuitive  appreciation  of  sums,  but  find 
products  confusing. 

Now  let  us  consider  a  more  realistic  example  in  which  each  X*  is  either  a  or  1/a  with  probability 
1/2.  In  this  case  the  p’th  moment  of  X  is 

(xl)  =  l(a’>  +  a->),  =*  (61) 


Before  continuing,  the  student  will  profit  from  showing  that  the  most  probable  value  of  X  is  Xmp  =  1 
(for  N  even).  For  example,  if  a  =  2  then  (X)  =  (5/4)A',  while  Xmp  =  1.  Again,  the  most  probable 
value  differs  exponentially  from  the  mean  value  as  N  -*■  oo. 


4.2  The  log-normal  distribution 

Because  Xmp  is  so  different  from  the  (X)  the  problem  of  determining  (X)  via  Monte  Carlo  simulation 
is  difficult:  one  may  have  to  exhaust  nearly  all  of  the  2N  cases  in  order  to  obtain  a  reliable  estimate 
of  (X).  This  exhaustion  is  necessary  for  the  first  example,  in  which  x*  —  0  or  2.  In  the  example  of 
equation  (61),  provided  that  a  fa  1,  we  can  get  a  pretty  good  estimate  of  (X)  with  less  than  exhaustive 
enumeration  of  all  sequences  of  the  xn’s. 

Begin  by  noting  that 


In  X  =  In  aq  +  In  X2  + - l-lnxyv,  (62) 

and  so  if  In  Xk  has  finite  variance  then  it  follows  from  the  Central  Limit  Theorem  (CLT)  that  A  =  In  X 
becomes  normally  distributed  as  N  — >•  oo. 

The  pitfall  is  in  concluding  that  all  the  important  statistical  properties  of  A,  and  therefore  of 
X  =  exp(A),  can  be  calculated  using  the  asymptotic  log-normal  distribution  of  X.  This  not  the  case 
because  the  PDF  of  A,  'P(A),  is  approximated  by  a  Gaussian  only  in  a  central  scaling  region  in  which 
|  Aj  <  cN x/2,  where  c  is  some  constant  which  depends  on  the  PDF  of  Xk •  On  the  other  hand,  a  reliable 
calculation  of  (Xp)  =  (exp(pA))  may  require  knowledge  of  the  tail-structure  of  P(A). 

To  illustrate  these  difficulties,  we  use  the  example  in  which  lnx*  =  ±lna  and  (In2  2*)  =  In2  a. 
Invoking  the  Central  Limit  Theorem,  the  asymptotic  PDF  of  A  is  therefore 

Pclt(A)  =  .  1  ,  exp  (-A2/2Xln2  a) .  (63) 

v2ttN  In2  a 

In  the  central  scaling  region,  'P(A)  «  'Pclt(A). 

To  determine  Xmp  we  can  consider  A  =  In  X,  which  is  an  additive  process  for  which  the  mean  and 
most  probable  coincide,  so  that 

(lnX)  =  lnXmp,  =»  Xmp  =  e<ln*>.  (64) 
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r(a,p),with  p=1 ,2,3,4 


Figure  7:  The  function  r(a,p)  defined  in  (67).  In  order  to  accurately  estimate  ( Xp )  using  the  CLT  one  must 
have  r  «  1. 


In  our  example  with  Inxk  =  ±lna,  (InX)  =0  and  therefore  Xmp  =  1.  (This  is  one  way  of  solving 
the  problem  posed  in  the  previous  section;  another  is  to  obtain  the  exact  V(A)  using  the  binomial 
density.) 

With  hope  in  our  hearts,  we  now  attempt  to  recover  the  exact  result  in  (61)  by  substituting  (63) 
into 

/oo 

epAV  (A)  dA.  (65) 

-OO 

After  the  integration,  one  finds  that 

(XP)CLT  =  exP  iNP2  ln2  a/2)  ■  (66) 

To  assess  the  error  we  form  the  ratio  of  the  exact  result  to  the  approximation: 

(. XP)/(XP)CVT  =  rN,  where  r  =  ^  exp  (-p2  In2  a/2)  (ap  +  a~p) .  (67) 
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When  r(a,p)  is  close  to  1,  the  error  is  tolerable  in  the  sense  that  In  (AP)CLT  is  close  to  In  ( Xp ). 

For  example,  with  a  =  2,  the  exact  result  is  ( X )  =  (5/4)N  while  (X)CLT  =  (1.27)^.  However 
the  second  moment  p  -  2,  is  seriously  in  error.  As  a  general  rule,  (XP)CLT  is  a  reliable  estimate  of 
(Xp)  provided  that  p2(ln2£*)  <  c,  where  c  is  the  constant  which  determines  the  width  of  central 
scaling  region,  |A|  <  cN 1/2,  in  which  V(A)  «  'Pclt(A).  We  conclude  that  the  complete  analysis  of  a 
random  multiplicative  quantity  cannot  be  reduced  to  the  Central  Limit  Theorem  merely  by  taking  a 
logarithm. 


4.3  Stretching  exponents 

Equation  (64)  is  a  very  important  result  for  multiplicative  random  variables:  to  obtain  the  most 
probable  value  of  X,  one  can  exponentiate  (In A).  This  explains  why  there  is  so  much  attention 
paid  to  (In [i(t)/io])  in  the  literature  on  random  line  element  stretching:  knowing  the  average  of  the 
logarithm  enables  one  to  estimate  the  stretching  of  a  typical  line  element.  Of  course,  the  typical  line 
element  may  not  make  a  large  contribution  to  the  dissipation  k(Vc'  ♦  Vc').  Thus  our  earlier  focus  on 
t 2  was  not  wasted,  but  it  was  not  complete  either. 

A  good  characterization  of  random  stretching  is  provided  by  the  complete  set  of  stretching  expo¬ 
nents.  Following  Drummond  &  Miinch,  we  define  the  stretching  exponents,  7P,  as 


7P  = 


lim  — ttt 

t— KX>  p(£P) 


d(£p) 

d  t  ’ 


P>  0, 


and 


7o  =  lim  t p  =  lim  —  ( In  £) . 
p->o  v  t-+  oo  d t 


(68) 


(69) 


Knowing  all  these  7’s,  the  asymptotic  growth  of  line  elements  is  characterized  by 

(P)1/p  ~  «0e7pt  - 


(70) 


Back  in  section  2  we  calculated  only  72  (e.g.,  see  (33)  and  the  subsequent  discussion).  To  conclude 
this  section  I  will  discuss  the  calculation  of  the  other  stretching  exponents,  particularly  70. 


4.4  The  stretching  exponent  70  of  the  a- (  model 

As  an  example  of  the  difference  between  70  and  72  we  return  to  the  cr-Q  model.  In  section  2  we 
obtained  a  general  expression  for  72  in  (37).  Now  consider  the  problem  of  determining  70.  Taking  the 
log  of  (32).  writing  £0  =  £0(cosx, sinx),  and  then  integrating3  over  x,  we  have  after  some  travail, 

(ln(V4)>x  =  ^ln(l  +  |)  ,  (71) 


where  ~(a.  (.  r)  is  given  in  (31).  Averaging  over  a  and  (,  and  using  70  =  r  1  ( In  (£i/Iq)  ),  gives 

dadC  •  (72) 


70  =  hll 

The  expression  above  should  be  compared  with  that  for  72  in  (37). 


1  +  ^H(<r,C,r) 


3The  integral 


J  ln(a  ±  bcosx)  dx  =  7r  In  -f  yj a2  —  62^  /2J  , 


is  useful. 
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With  the  ensemble  of  random  Couette  flows  in  section  2.2.2  we  can  evaluate  the  integrals  in  (37) 
and  (72).  Thus,  we  find  that 


To 


1  1  (t  P2t2\  1  ,  (,  P2t2\ 

=  *'n(1  +  —)’  72  =  2?ln(1  +  — )  ■ 


(73) 


Notice  that  72  >  To-  This  is  a  illustration  of  the  general  result  that  jp  is  an  increasing  function  of  p 
(Childress  &  Gilbert  1995).  Figure  3  compares  the  expressions  for  70  and  72  in  (73)  with  simulation. 


4.5  Stretching  in  one-dimension 

One-dimensional  compressible  velocity  fields  provide  striking  examples  of  the  nontrivial  dependence 
of  on  p.  We  conclude  this  lecture  with  a  model  of  random  one-dimensional  stretching  for  which 
the  -,,’s  can  be  obtained  analytically. 

4.5.1  A  sinusoidal  velocity 

With  the  one-dimensional  velocity  u  =  sins,  the  equation  governing  line  element  stretching,  (2),  is 

£t  +  sin  a;  &  =  £cosx,  £(x,0)  =  l.  (74) 

The  initial  condition  above  is  that  the  line  elements  attached  to  different  fluid  particles  all  have  the 
same  initial  length.  Because  the  fluid  is  compressible,  the  fluid  density  p(x,  t )  satisfies 

pt  +  (sin  x  p)x  =  0  p{x ,  0)  =  1.  (75) 

it  is  easy  to  show  by  substitution  that  the  solutions  of  (74)  and  (75)  are  related  p(x,t)  =  l/£(x,t). 
Tii*  physical  interpretation  of  this  result  should  be  obvious... 

To  solve  (74),  we  follow  the  route  outlined  in  section  1.3  by  determining  the  mapping  from  the 
initial  space,  xo,  to  the  space  x(x<> ,t).  This  means  we  solve 

T)  T 

—  =  sin  x,  x(0,  x0)  =  x0  •  (76) 

1  Ting  separation  of  variables  we  find  that 

tan(x/2)  =  e4  tan(xo/2) ,  (77) 

which  enables  us  to  determine  x  given  xo,  or  vice  versa.  Figure  8  shows  how  the  mapping  from  x0  to 
1  evolves  as  t  increases. 

In  this  one- dimensional  example,  the  Jacobian  of  the  mapping  is  simply 
dx  1 

- —  =  — - - —  =  cosh  t  +  cos  x  sinh  t .  (78) 

dxo  cosh  t  —  cos  xo  sinh  t 

It  is  easy  to  check  by  substitution  that  £  =  dx/dxo  is  the  solution  of  (74). 

4.5.2  A  one-dimensional  renovating  model 

Using  the  previous  one-dimensional  illustration  of  the  Cauchy  solution,  we  can  formulate  a  renovating 
model  that  illustrates  some  of  the  subtleties  involved  in  random  stretching  problems.  Consider  an 
ensemble  of  random  renovating  one-dimensional  velocity  fields  in  which 

u  =  sin(x  +  ipn)  if  (n  —  l)r  <t  <nr .  (79) 


59 


Figure  8:  The  left  panel  shows  the  mapping  from  xo  to  x  at  the  indicated  times.  The  interval  0  <  xo  <  7r  is 
compressed  into  the  neighbourhood  of  x  =  n.  The  right  panel  shows  J(xo,t)  at  the  same  times.  Notice  that 
an  element  that  starts  at  say,  xo  =  1/2,  is  first  stretched  (J  >  1)  but  then  ultimately  compressed  (J  <  1)  as 
the  particle  approaches  x  =  7r. 


The  random  phase,  0  <  tpn  <  27t,  is  reset  at  t  —  nr.  Notice  that  there  is  no  preferred  location  on  the 
z-axis;  that  is,  the  statistical  properties  of  the  process  are  spatially  homogeneous. 

Now,  suppose  we  follow  the  stretching  of  a  line  element  attached  to  a  particle  that  moves  in  a 
particular  realization  of  this  velocity  field.  We  denote  location  of  this  particle  at  t  =  nr  by  an,  and 
the  length  of  the  attached  line  element  at  this  time  by  ln.  Then,  using  the  solution  from  the  previous 
section,  the  stretching  of  the  line  element  is  given  by  the  random  product 

In  =  j(an-i)  J(a„_2)  •  *  *  J(ao)£o  ,  (80) 


where  the  Jacobian  is 


J(a)  = 


_ 1 _ 

cosh  r  —  cos  a  sinh  r 


(81) 


Because  the  phase  is  reset  at  t  =  nr,  each  J(o„)  in  (80)  is  independent  of  the  others.  Moreover, 
because  of  spatial  homogeneity,  each  an  is  uniformly  distributed  with  0  <  a„  <  2k. 

Equation  (80)  expresses  the  length  of  a  material  line  element  at  t  =  nr  as  a  product  of  n  random 
numbers.  Following  our  earlier  discussion  of  multiplicative  random  variables,  we  first  calculate  70  by 
taking  the  logarithm  of  (80): 

n  —  l 

ln(4/4)  =  ]>>./(«*),  (82) 

*=0 


Thus,  the  mean  of  ln(^n/^o)  is 

(ln(4/4))=n(lnJ),  (83) 

where 

(In  J)  =  j>  ln [J(a)]  ^  =  —  In [cosh(r/2)]  .  (84) 

Because  ((In  J)2)  is  finite,  the  central  limit  theorem  applies  and  we  conclude  that  as  n  -»  oo,  ln(£n/4) 
is  approximately  normally  distributed  with  the  mean  value  n(  ln  J). 
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Figure  9:  The  stretching  exponents  7 p(r),  with  p  =  0,  !,•••,  8  calculated  using  (89)  . 


Moreover,  we  can  conclude  from  the  central  limit  theorem  that  the  most  probable  value  of  tn/l 0  is 

(4/4)mp«e(1^/<»»=e^)  (85) 

where,  since  n  =  t/r, 

70  =  —  ln[cosh(r/2)]/r  <  0 .  (86) 

The  result  in  (85)  is  remarkable  because  it  implies  that  most  of  the  -line  elements  in  this  compressible 
flow  exponentially  contract  (rather  than  stretch)  as  t  -»  00! 

Exponential  contraction  of  most  material  lines  is  incomplete  disagreement  with  the  spirit  of  Batch¬ 
elor’s  result  in  (1),  where  7  >  0.  The  result  above,  that  70  <  0,  is  a  special  consequence  of  the 
compressible  velocity  field  used  in  (79).  (For  a  discussion  of  compressible  velocities  in  a  space  of 
arbitrary  dimension,  see  Chertkov  et  al.  (1998).)  In  any  event,  this  example  shows  that  one  cannot 
take  exponential  stretching  for  granted,  no  matter  how  intuitive  it  seems  on  the  basis  of  experiments, 
such  as  those  of  Welander  (1955).  . 

How  is  contraction  in  the  length  of  most  material  elements  compatible  with  conservation  of  the 
total  length  of  the  x-axis?  The  answer  is  that  even  though  most  elements  become  exponentially 
small  as  t  — >  00,  a  few  elements  become  exponentially  large.  Thus  most  of  the  length  accumulates  in 
exponentially  rare,  but  exponentially  long,  line  elements.  This  is  an  elementary  example  of  an  inverse 
cascade  i.e.,  the  spontaneous  appearence  of  large-scale  structures  (big  line  elements). 

To  demonstrate  length  conservation,  we  can  compute  the  mean  (as  opposed  the  most  probable) 
length  of  an  element.  The  mean  length  is 

(4)  =  <J>%,  (87) 

where  J(a)  is  defined  in  (81)  and 

(J)  =  fj(a)^  =  l.  (88) 

Thus,  the  mean  length  of  an  element  is  constant,  even  though  most  elements  exponentially  contract. 

As  an  exercise,  I  suggest  showing  that  for  integer  values  of  p  the  stretching  exponents  of  this 
one-dimensional  model  are  given  by 

7p  =  In  [PP- 1  (cosh  r)]  /pr ,  (89) 

where  Pm  is  the  m’th  Legendre  polynomial.  Thus,  in  this  particular  example,  there  is  a  nice  analytic 
characterization  of  the  rate  at  which  different  moments  stretch  (see  figure  9) . 
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Lecture  4:  Anomalous  diffusion 


In  this  lecture  we  discuss  stochastic  models  of  correlated  random  walks.  By  “corre¬ 
lated”  we  mean  that  that  if  a  particle  is  headed  in  one  direction  then  there  is  nonzero 
probability  that  it  continues  in  that  same  direction  for  some  time  and  this  probability 
fades  to  zero  as  the  time  interval  increases.  This  is,  of  course,  the  situation  envisaged 
by  Taylor  (1921). 

The  distinction  between  normal  and  anomalous  diffusion  made  in  lecture  1  can 
be  understood  by  examining  the  rate  at  which  velocity  correlation  decrease  to  zero. 
Normal  diffusion  occurs  if  the  velocity  correlation  decrease  rapidly  while  anomalous 
diffusion  results  from  processes  in  which  particles  move  coherently  for  long  times  with 
infrequent  changes  of  direction.  Roughly  speaking,  this  distinction  is  quantified  by  the 
tail  behaviour  of  the  velocity  autocorrelation  function.  For  example,  if  the  correlation 
function  decays  exponentially  then  there  is  normal  diffusion,  whereas  if  the  correlation 
function  decays  algebraically  then  there  is  the  possibility  of  anomalous  diffusion. 

The  definition  of  anomalous  diffusion  is  based  only  on  the  behaviour  of  the  second 
moment,  ( x 2).  But  we  usually  want  to  know  more  about  the  distribution  of  a  tracer 
than  simply  the  second  moment.  In  the  case  of  normal  diffusion,  detailed  information 
concerning  the  tracer  distribution  is  obtained  by  solving  the  diffusion  equation 


ct  —  Dc%x  •  (1) 

Can  we  obtain  continuum  models,  analogous  to  (1),  which  provide  the  same  detailed 
information  for  anomalously  diffusing  tracer?  The  main  goal  of  this  lecture  is  to  develop 
partial  differential  equation  models  which  can  be  used  for  this  purpose. 


1  Superdiffusion  and  subdiffusion 


1.1  Taylor’s  formula  and  long  tails 

Yet  again  we  recall  Taylor’s  formula  which  relates  the  growth  of  position  variance  to 
an  integral  of  the  Lagrangian  velocity  autocorrelation  function,  corr(t), 


d(*2) 

d  t 


(2) 


63 


In  order  to  obtain  ( x 2)  we  must  integrate  (2).  Standard  manipulations  turn  the  result¬ 
ing  double  integral  of  corr(t)  into  a  single  integral 

(x2)  =  2  /  (t-Ocorr(t')dt'.  (3) 

Jo 

The  result  (3),  which  is  not  in  Taylor’s  original  paper,  will  prove  to  be  very  useful. 

We  usually  have  in  mind  situations  in  which  corr(t)  decreases  to  zero  as  t  -4  oo  so 
that  the  integrals  in  (2)  and  (3)  converge  to  nonzero  values.  An  example  is  the  reno¬ 
vating  wave  model,  with  its  “triangular”  correlation  function,  from  lecture  2.  Later  in 
this  lecture  I  will  introduce  the  telegraph  model  which  has  an  exponentially  decaying 
correlation  function,  corr(f)  =  U2  exp(— 2at).  These  are  both  examples  in  which  cor¬ 
relations  decrease  very  rapidly  so  that  normal  diffusion  occurs.  But  now  consider  the 
possibility  that  corr(t)  decreases  so  slowly  that  the  integrals  in  (3)  diverge. 

Suppose,  for  instance,  that  as  t  — >  oo,  corr(t)  ~  t~v  with  0  <  rj  <  1.  Even  though 
the  diffusivity  no  longer  exists,  it  still  follows  from  (3)  that 

(x2)  ~  t2-*  .  (4) 

In  this  case  there  is  superdiffusion:  the  variance  of  the  particle  displacement  grows 
faster  than  linearly  with  time  because  2  —  p  >  1. 

Taylor’s  formula  also  contains  the  possibility  of  subdiffusion.  This  case  is  subtle 
because,  like  the  example  of  the  sea-surface  mentioned  in  lecture  1,  it  requires  that  the 
integral  defining  D  is  zero.  But  suppose  additionally  that  the  remaining  integral  in  (3) 
diverges.  This  can  happen  if  corr(f)  ~  ct~v  with  1  <  rj  <  2.  The  condition  that  1  <  r) 
ensures  that  f0°°corr(t')  d f  converges  (to  zero).  The  second  inequality,  r)  <  2,  ensures 
that  f*  t'coir(t')  dt'  diverges.  Using  (3),  we  again  find  the  scaling  law  in  (4).  However 
this  time,  because  2  —  77  <  1,  there  is  subdiffusion. 

At  first  glance  two  possibilities  above  appear  as  unlikely  exceptions  to  the  more  nat¬ 
ural  cases  in  which  both  integrals  in  (3)  converge.  However  there  are  examples  in  fluid 
mechanics  in  which  either  subdiffusion  or  superdiffusion  is  observed  experimentally  or 
computationally.  Thus  (4)  cannot  be  dismissed  as  an  unlikely  pathology. 

1.2  The  Texas  experiments 

An  experiment  illustrating  anomalous  diffusion  has  been  conducted  in  Swinney’s  lab¬ 
oratory  at  University  of  Texas;  see  Solomon,  Weeks  &  Swinney  (1994)  and  Weeks, 
Urbach  &  Swinney  (1996).  These  investigators  study  the  dispersion  of  particles  in  an 
almost  two-dimensional  flow  in  annular  tank  (see  figure  1).  The  tank  is  rotating  at 
about  1  or  2  Hertz  and  the  bottom  is  sloped  to  simulate  the  /3-effect.  Because  of  the 
rapid  rotation  the  flow  is  quasi  two-dimensional. 

The  flow  is  forced  by  pumping  fluid  through  the  tank.  If  the  pumping  rate  is 
sufficiently  large  then  this  azimuthal  flow  is  unstable  to  a  vortex-forming  instability. 
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Figure  1:  A  sketch  of  the  rotating  annulus;  the  rotation  rate  is  about  1Hz.  Flow  is  forced  by 
pumping  water  in  through  the  ring  of  holes  marked  by  I  and  withdrawing  the  same  volume 
through  the  other  ring  marked  0.  As  a  consequence  of  the  strong  Coriolis  forced  acting  on 
the  radial  flow  between  these  concentric  rings  there  is  an  azimuthal  flow  around  the  annulus. 
The  experiment  is  viewed  from  above  using  a  video  camera.  Figure  courtesy  of  Eric  Weeks. 


A  typical  flow  pattern  in  the  rotating  frame  is  shown  in  figure  2.  Evident  also  in  this 
figure  is  the  azimuthal  jet  which  runs  all  the  way  around  the  tank.  The  vortex  pattern 
can  be  perturbed  experimentally  by  making  the  strength  of  the  pumping  depend  on 
azimuth.  In  this  fashion,  one  can  drive  an  unsteady  flow  and  observe  chaotic  particle 
trajectories. 

Automated  image  processing  techniques  are  used  to  follow  nearly  neutrally  buoyant 
tracer  particles  suspended  in  such  flow.  Typical  particle  trajectories  are  shown  in  figure 
3.  Particles  within  a  vortex  remain  trapped  for  very  long  time  (stick).  Particles  in 
the  azimuthal  jet  experience  prolonged  flights  around  the  circumference  of  the  tank. 
Because  the  vortex  pattern  is  not  perfectly  stationary  particles  alternate,  apparently 
randomly,  between  flying  in  jets  and  sticking  in  vortices. 

One  can  change  the  pattern  of  jets  and  vortices  by  altering  the  diameters  of  the 
circular  barriers  which  confine  the  flow.  Thus  it  is  possible  to  create  a  flow  with  two 
oppositely  directed  jets  separated  by  a  vortex  chain.  In  this  case  the  dispersion  process 
is  more  symmetric  than  in  figure  4  because  the  flights  go  in  both  directions  around  the 
tank. 

During  a  flight  the  angular  displacement  is  proportional  to  the  time  elapsed  since 
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Figure  2:  Streaks  formed  by  100s  trajectories  of  12  particles  reveal  four  vortices.  Weeks  et 
al.  show  that  the  motion  of  these  coherent  vortices  is  chaotic.  That  is,  a  velocity  spectrum, 
obtained  by  measuring  velocity  with  a  hot  film  probe,  is  broad  band.  Figure  courtesy  of  Eric 
We 


Figure  3:  Trajectories  of  three  tracer  particles  in  the  flow  shown  in  figure  2.  The  beginning 
of  each  trajectory  is  indicated  by  a  triangle  and  the  end  by  with  a  circle.  In  (b)  the  particle 
spends  most  of  its  life  trapped  in  a  single  vortex.  However,  this  vortex  wobbles  erratically 
because  the  flow  is  chaotic.  In  parts  (a)  and  (c)  the  particles  experience  several  episodes  of 
trapping  within  a  vortex  and  flight  around  the  tank  in  the  jet.  Figure  courtesy  of  Eric  Weeks. 


t(s) 


Figure  4:  Angular  displacement,  0(f)  for  the  trajectories  in  figure  3.  There  is  an  obvious 
distinction  between  the  flights  and  the  sticking  events.  The  small  oscillations  during  the 
sticking  events  correspond  to  particle  motion  within  a  vortex.  Figure  courtesy  of  Eric  Weeks. 
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the  flight  began: 


A 6  &  Ut.  (5) 

The  displacement,  A 6,  is  essentially  zero  during  a  sticking  event  (see  figure  4). 

The  experiments  show  that  the  dispersion  of  an  ensemble  of  particles  is  superdiffu- 
sive.  That  is 

<(*  -  W)2>  -  *7  (6) 

where  7  >  1;  typical  values  are  7  &  1.4  to  1.7  depending  on  the  experimental  configu¬ 
ration.  (It  is  also  possible  to  observe  normal  diffusion,  7  =  1,  by  strongly  forcing  the 
flow  and  breaking  the  azimuthal  symmetry  of  the  forcing.) 

To  characterize  the  motion  Solomon  et  al.  used  sticking  and  flying  PDFs: 

7V(a)da  =  Probability  that  a  flight  has  a  duration  €  (a,  a  4-  da) .  (7) 

Later  in  this  lecture  we  will  refer  to  a  as  the  “lifetime”  of  a  particle  in  the  flying  or 
sticking  state.  We  figuratively  speak  of  a  particle  being  born  into  the  flying  state  and 
moving  coherently  for  a  lifetime  a  so  that  the  total  angular  displacement  during  the 
flight  is  AO  =  U a. 

The  PDF  Vf  is  normalized  by  J0°°  Vf  (a)  da  =  1  and 

poo 

Tp  —  I  aVpia)  da  =  average  duration  of  a  flight .  (8) 
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The  PDF  of  sticking  times,  Vs  (a),  and  the  average  sticking  time,  ts,  are  defined 
analogously. 

Experiments  show  that  as  a  — *•  00,  Vp  and  Vs  have  algebraically  decaying  tails: 


Vf{o)  ~  a-w%  Vs (a)  ~  a"^5, 

(9) 

with 

2  <  (fip,  As)  <  3. 

(10) 

Because  of  this  slow  algebraic  decay  the  variance  of  the  lifetimes,  defined  by 

poo 

(a2)FS=  /  Q2Vf,s(o)  da , 

Jo 

(11) 

diverges. 

The  divergence  of 

{a2)F  is  significant  because  invoking  Einstein’s  formula  for  the 

diffusivity 

D  _  <(A0)2> 

T 

(12) 
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and  using  A#  =  [/a,  we  conclude  that  D  oc  ((Ad)2)  =  U2(a2)F  =  oo.  The  divergence 
of  D  is  symptomatic  of  superdiffusion. 

Notice  that  the  denominator  t  in  (12)  is  related  to  the  average  flying  and  sticking 
times,  Tp  and  ts,  which  are  both  finite.  Thus,  in  the  Texas  experiments,  we  can  say 
that  anomalous  diffusion  occurs  because  the  numerator  of  (12)  is  divergent.  In  other 
cases  it  is  the  denominator  which  causes  trouble. 

The  Texas  experiments  show  that  anomalous  diffusion  occurs  in  realistic  and  geo¬ 
physically  relevant  systems.  Several  theoretical  questions  suggest  themselves.  How  do 
the  algebraic  tails  of  Vs  and  Vf  arise,  and  can  we  make  a  microscopic  models  which 
exhibits  this  phenomenon?  Can  we  relate  the  exponents  7,  fxp  and  /is?  (From  section 
4,  the  answer  to  the  last  question  is  7  =  4  —  /tp.) 

2  The  telegraph  model 

The  key  issue  raised  by  anomalous  diffusion  is  decay  of  velocity  correlations.  Thus 
our  goal  is  to  formulate  models  for  which  we  can  explicitly  calculate  velocity  statistics 
and  understand  the  decay  of  correlations.  Our  first  attempt  is  not  very  ambitious:  we 
begin  with  the  telegraph  model ,  which  is  the  simplest  example  of  a  continuous-time 
correlated  random  walk. 

2.1  The  Lagrangian  formulation  of  the  telegraph  model 

In  a  telegraph  process  the  velocity  of  particle  n,  denoted  by  un(t),  can  have  only  one  of 
two  possible  values,  +U  and  —U.  The  velocity  of  each  particle,  un(t),  flips  randomly 
back  and  forth  between  ±17  with  a  transition  probability  a  per  time.  This  means  that 
in  a  time  dt  a  fraction  adt  of  the  ensemble  switches  velocity.  Because  the  transition 
rate,  a,  is  constant  we  can  say  that  a  particle  has  no  “memory”  of  when  it  first  arrived 
in  its  present  state.  Thus  this  telegraph  model  is  Markovian. 

We  refer  to  the  prescription  for  constructing  a  telegraph  process  as  model  A.  There 
is  a  variant,  model  B,  discussed  below. 

With  the  prescription  above,  the  velocity  of  a  particle  is  a  discontinuous  function 
of  time  as  shown  in  figure  5.  The  correlation  function  and  the  diffusivity  are 

r°°  U2 

corr (t)  =  U2e~2a ^  ,  D  =  /  corr(t)  dt  —  — - ,  (model  A) .  (13) 

Jo  2 a 

Notice  that  the  corrtt  is  infinite  at  t  =  0;  this  is  because  the  acceleration  is  infinite  at 
the  discontinuities  in  figure  5. 

To  obtain  (13),  return  to  the  definition  of  the  correlation  function 

1  N 

corr(t)  =  —  ^2un(0)un(t),  (14) 

n= 1 
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A  random  telegraph  process  C(t)=U2exp[-2  ocltl] 


Figure  5:  An  example  of  a  telegraph  time  series,  and  the  telegraph  correlation  function. 

where  N  is  the  total  number  of  particles  in  the  ensemble.  Suppose  that  at  t  the  sum 
on  the  right  hand  side  has  P(t )  positive  terms,  all  equal  to  £/2,  and  N  —  P(t)  negative 
terms,  all  equal  to  —  U2.  Thus 

corr(t)  =  ~[2P(t)  -  N) .  (15) 

In  a  time  d t,  Padt  of  the  positive  terms  become  negative  and  (N  —  P)adt  of  the 
negative  terms  become  positive.  Thus,  at  t  +  dt, 

P{t  +  df)  =  P(t)  (1  -  2adt)  +  Nadt ,  (16) 

and  the  analog  of  (15)  is: 

U2 

corr(t  +  dt)  =  — [2P(t)(l  —  2o;dt)  +  2Nadt  —  AT] .  (17) 

Taking  the  limit  dt  —¥  0  in  (17)  gives  corrt  =  — 2acorr;  the  solution  of  this  differential 
equation  is  (13). 

An  alternative  telegraph  process  (model  B)  is  constructed  by  imagining  that  at 
random  instants  each  particle  flips  at  coin.  The  flipping  rate  is  a  so  that  in  a  time  dt, 
there  are  Nadt  coin  flips.  After  each  flip,  the  velocity  is  +U  if  there  is  a  head  and  —U 
if  a  tail.  With  this  prescription,  a  particle  wall  change  direction  on  average  once  out  of 
every  two  tosses.  On  the  other  tosses  the  particle  continues  in  the  same  direction  and 
the  result  is  as  if  nothing  happened.  Thus  with  model  B  we  simply  replace  a  by  a/2 
in  our  earlier  calculations  and  consequently  the  correlation  function  and  diffusivity  are 

r°°  U2 

corr(t)  =  U2e~a'^,  corr(t)dt= — ,  (model  B).  (18) 

Jo  a 
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The  difference  between  model  A  and  model  B  is  trivial.  However  the  distinction  be¬ 
tween  the  two  cases  will  plague  us  later. 

If  we  are  searching  for  a  model  of  anomalous  diffusion  then  the  telegraph  model  is 
a  disappointment:  the  exponentially  decaying  correlation  function  ensures  that  D  is 
finite  and  that  the  displacement  variance  ultimately  grows  diffusively.  We  continue  our 
investigation  of  the  telegraph  model  in  order  to  better  understand  “ultimately”  and 
bec  ause  in  section  4  the  telegraph  model  is  used  as  the  foundation  of  more  elaborate 
models  which  do  show  anomalous  diffusion. 

2.2  The  Eulerian  formulation  of  the  telegraph  model 

Now  we  ignore  the  Lagrangian  information  contained  in  the  correlation  function  (14) 
and  instead  we  give  an  Eulerian  formulation  of  the  telegraph  process.  Let  R(x,t ) 
denote  the  density  (particles/length)  of  particles  moving  to  the  right  with  velocity  +U 
and  L(x,t)  denote  the  density  of  left-moving  particles  with  velocity  —U.  The  coupled 
conservations  laws  are 

Rt  +  URx  =  a{L-R),  Lt  -  ULX  =  a(R  -  L).  (19) 

These  equations  should  be  self-evident... 

We  can  put  (19)  into  a  revealing  alternative  form  by  defining  the  total  concentration, 
and  the  flux,  F(x,t),  as 

C  =  R  +  L,  F  =  U(R  —  L) .  (20) 

In  terms  of  these  new  variables  the  model  is 

Ct  +  Fx  =  0,  Ft  +  2aF  =  -U2Cx.  (21) 

The  first  equation  is  conservation  of  particles  and  the  second  equation  is  the  flux- 
eradient  relation. 

Notice  that  in  (21)  Fick’s  law  does  not  apply  —  the  flux  F  is  not  instantaneously 
related  to  the  gradient  Cx.  Equation  (21b),  which  might  be  called  Cattaneo’s  law 
i. see  the  1989  review  by  Joseph  and  Preziosi),  can  be  solved  as  a  first-order  differential 
equation  for  F(x,t).  Thus,  the  flux  at  x  is  expressed  as  weighted  integral  over  the  past 
history  of  the  gradient  at  x\ 

F(z,  t)  =  -U2  f  (*,  t')  d t' .  (22) 

The  flux  has  a  “fading  memory”  of  the  gradient  and  the  exponential  in  (22)  is  the 
fading  factor  which  strongly  weights  the  most  recent  values  of  the  gradient. 

Next,  if  we  eliminate  F  from  (21),  we  obtain 

Ctt  +  2aCt-U2Cxx  =  0.  (23) 
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This  is  the  telegraph  equation;  the  diffusion  equation  is  obtained  only  as  an  approx¬ 
imation  which  applies  to  the  low  frequency  and  wavenumber  components  of  C(x,t). 
On  these  large  and  slowly  evolving  scales  one  can  neglect  the  term  Ctt  in  (23)  and  so 
obtain  the  approximation 

Ct  ~  DCXX ,  D  =  (24) 

la 

The  diffusivity  D  in  (24)  was  anticipated  in  (13). 

Figure  6  shows  a  numerical  solution  of  (23)  starting  with  an  initial  condition  of 
the  form 


R{x,0)  =  L{x,0)  =  e-*2^ .  (25) 

At  small  times  the  density  C  develops  a  double  peaked  structure  as  the  left  and  right 
going  populations  separate.  This  behaviour  is  transient,  and  at  longer  times  the  central 
part  of  the  concentration  relaxes  to  the  well-known  Gaussian  solution  of  the  diffusion 
equation. 

According  to  (23)  the  disturbance  travels  at  a  finite  speed:  these  are  the  “heat 
waves”  discussed  by  Joseph  and  Preziosi  (1989),  and  also  evident  in  figure  6.  The 
approximate  diffusion  equation  (24)  makes  the  unrealistic  prediction  that  disturbances 
are  propagated  at  infinite  speed.  This  unphysical  consequence  of  the  diffusion  equation 
motivated  Cattaneo  to  propose  (21b)  as  an  alternative  to  Fick’s  law. 

These  considerations  shows  that  one  cannot  blithely  assert  the  validity  of  the  diffu¬ 
sion  equation  (24)  as  an  exact  description  of  dispersion.  The  diffusion  equation  applies 
only  as  an  approximate  description  of  low  frequencies  and  long  wavelengths. 

2.3  Discretization  of  the  telegraph  model 

This  section  is  a  digression.  Read  on  if  you  want  to  learn  how  to  solve  the  telegraph 
equation  using  a  simple  numerical  scheme.  (This  is  how  I  drew  figure  6.) 

We  reformulate  the  telegraph  model  in  terms  of  discrete  variables:  divide  the  x-axis 
is  divided  into  segments  of  length  5x  separated  by  “scattering  sites”  at  xn  =  nSx.  Time 
is  also  discretized  in  units  of  8t  so  that  t  =  T8t  where  T  is  an  integer  T  =  0, 1, 2  •  •  • 
The  walkers  move  along  the  x-axis  with  a  velocity  that  is  either  +8x/5t  or  —8x/8t.  In 
terms  of  the  continuous  model  in  (19) 

St 

u=i-  <26) 

When  a  walker  reaches  the  scattering  site  at  xn  =  nSx  he  is  “backwards  scattered”  or 
reflected  with  probability  b  and  “forward  scattered”  or  transmitted  with  probability 
1  —  b.  Because  the  probability  of  a  change  in  direction,  b,  is  the  same  for  left  as  for 
right  moving  walkers  there  is  no  mean  velocity  along  the  line. 
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Figure  7:  A  steady  state  with  constant  flux,  f  =  U{  1  —  b)(R  —  L),  passing  through  site  n. 

Let  Rn(T)5x  be  the  number  of  right  walkers  in  the  segment  n,  ndx  <  x  <  (n+l)5x. 
The  number  of  left  walkers,  Ln(T)5x,  is  defined  analogously.  Rn  and  Ln  are  the  discrete 
analogs  of  the  continuous  densities  used  in  (19). 

With  these  rules  and  definitions,  the  discrete  evolution  equations  for  the  ensemble 
are 

Rn{T)  =  (1  -  b)R „_i(T  -  1)  +  bLn(T  -  1) ,  (27) 

Ln(T)  =  (1  -  b)Ln+i(T  -  1)  +  bRniT  -  1) .  (28) 

For  instance,  in  the  first  equation  above,  the  number  of  right  movers  in  segment  n  is 
equal  to  the  number  in  segment  n  —  1  at  the  previous  time  that  successfully  passed 
through  scattering  site  n,  plus  the  number  of  left  movers  previously  in  segment  n  that 
were  reflected  at  this  same  site.  Figure  6  shows  the  result  of  iterating  the  discrete 
system  above. 

One  exact  solution  of  the  difference  equations  above  is 

Rn  —  Ln  =  Ln+i  =  Rn+l  —  (29) 

This  solution  is  steady:  Rn(T)  =  Rn(T  -  1).  In  fact,  (29)  is  the  discrete  analog  of  the 
t  quihbrium  solution  of  the  diffusion  equation.  The  distribution  of  walkers  is  spatially 
uniform  with  equal  numbers  going  left  and  right  in  each  interval  and  there  are  no 
concentration  gradients.  An  individual  walker  is  moving  to  and  fro,  but  the  ensemble 
is  in  steady  state. 

Next,  we  consider  the  constant-flux  solution.  In  figure  7,  R  right  walkers  impinge 
on  site  n  from  the  left  and  L  left  walkers  impinge  on  n  from  the  right.  In  steady  state 
it  must  be  that  on  the  left  of  n  there  are  bR+  (1  -  b)L  left  walkers  moving  away,  while 
to  the  right  there  are  bL  +  (1  —  b)R  right  walkers  moving  away.  Thus  the  flux  to  the 
right  of  the  site  is 

f  =  U[{l-b)R  +  bL]  -UL  =  —  (1  -b)(R-L),  (30) 
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where  we  have  used  U  =  8x/8t.  Because  there  is  a  steady  state,  calculating  the  flux  to 
the  left  of  the  site  gives  exactly  the  same  result  and  so  there  is  a  nondivergent  flux  of 
walkers  along  the  line. 

Next,  we  can  calculate  the  concentration  difference  across  the  site  in  figure  7.  To 
the  right  of  the  site  the  total  density  of  walkers  is 

c+  =L  +  bL  +  {l-b)R,  (31) 

while  on  the  left  the  density  is 

c"  =  R  +  bR  +  (1  -  b)L.  (32) 

Combining  (31)  and  (32)  we  have  for  the  concentration  jump  across  the  site 

8c  =  c+-c~  =  2 b(L  -  R)  (33) 


Thus,  using  (30),  the  flux-gradient  relation  in  steady  state  is 

f-_n—  n  -  C1  Z 
1  8x'  2  b  8t  ' 


(34) 


Does  it  seem  obvious  to  you  that  the  diffusivity  should  diverge  as  6  — >  0?  If  you  think 
of  the  diffusivity  as  the  area  under  the  correlation  functions  then  this  divergence  should 
be  intuitive.  It  is  an  instructive  exercise  to  obtain  D  in  (34b)  using  Taylor’s  formula. 
(Hint:  consider  N  1  right  walkers  which  initially  set  out  together.  At  t  —  T8t,  after 
T  encounters  with  scattering  sites,  how  many  of  these  walkers  have  changed  direction 
an  even  number  of  times,  and  how  many  odd?) 

Comparing  the  equation  above  with  our  earlier  expression  for  the  diffusivity,  D  in 
(13)  and  (24),  we  conclude  that 

ait  =  jTj  .  (35) 


Thus,  with  (26)  and  (35),  we  can  express  the  parameters  of  the  discrete  model,  ( 8x ,  8t,  b), 
in  terms  of  the  parameters  characterizing  the  continuous  model,  U  and  a. 


3  Age-stratified  populations 

The  telegraph  model  from  section  2  is  Markovian.  This  means  that  each  particle  has  a 
constant  probability  per  unit  time,  a,  of  switching  direction.  Thus,  no  matter  how  long 
a  particle  has  been  moving  to  the  right  (say),  its  probability  of  switching  direction  in  the 
next  d t  is  always  ad t.  Consequently  an  exponentially  decreasing  number  of  particles 
move  coherently  for  long  intervals  and  the  telegraph  model  in  (19)  does  not  exhibit 
anomalous  diffusion. 
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A  satisfactory  description  of  anomalous  diffusion  demands  a  non-Markovian  model 
in  which  particles  have  some  memory  of  their  past  motion.  To  obtain  superdiffusion  it 
is  necessary  that  a  right-moving  particle  is  less  and  less  likely  to  change  direction  as  it 
spends  more  and  more  time  moving  right1. 

Such  memory  effects  are  implicit  in  the  models  discussed  by  Weeks  et  al  (1996),  and 
in  several  of  the  articles  in  the  conference  proceedings  edited  by  Schlesinger,  Zaslavsky 
&  Frisch  (1994).  The  stochastic  models  discussed  in  Schlesinger  et  al.  draw  heavily 
on  statistical  physics.  In  this  lecture  we  are  going  to  develop  the  theory  from  scratch 
using  a  formalism  which  is  accessible  to  people  whose  background  is  in  fluid  mechanics. 
The  climb  begins  with  an  excursion  into  the  theory  of  age-stratified  populations. 

Consider  a  population  of  items  with  a  finite  lifetimes  and  a  death  rate  which  depends 
on  age,  a.  For  example,  light  bulbs  in  a  large  building,  or  the  population  of  the  United 
States.  At  time  t  the  age  structure  of  the  population  is  characterized  by  a  density 
function  for  which  /(o,  t)da  is  the  number  of  items  whose  age  is  between  a  and  a -I-  da. 
In  terms  of  /,  the  total  number  of  items  in  the  population,  N(t),  and  the  average  age, 
a(t),  are  given  by 

roc  r  oo 

N(t)  =  /  f(a,t)da,  a(t)  =  iV-1  /  af(a,t)da.  (36) 

Jo  Jo 

The  density  function  evolves  according  to 

ft  +  fa  +  af  =  0,  (37) 

where  a(a)  is  the  death-rate.  The  term  fa  in  (37)  says  that  the  population  translates 
along  the  age-axis  at  a  rate  one  year  every  year.  To  completely  specify  the  problem 
we  must  supply  an  initial  condition,  and  also  a  boundary  condition  at  a  =  0.  The 
boundary  condition  at  a  =  0  has  an  obvious  interpretation: 

/ (0,  i)  =  the  birth  (or  replacement)  rate.  (38) 

In  the  case  of  a  population  of  people,  the  boundary  condition  above  is  a  flux  of  babies 
into  the  system. 

The  Markovian  limit  is  the  special  case  in  which  a  is  independent  of  a.  This  model 
of  a  is  unrealistic  for  both  light-bulbs  and  people,  though  it  might  apply  to  a  population 
of  radioactive  molecules.  The  Markovian  case  is  very  simple  because  one  can  integrate 
(37)  over  a  and  obtain  a  closed  equation  for  N(t ): 

Nt  +  aN  =  f(0,t).  (39) 

Thus  if  a  is  constant  and  we  need  only  the  total  number  of  functional  items  at  t  then 
we  do  not  need  to  solve  partial  differential  equations  and  deal  with  the  age  structure 
of  the  population. 

1 A  popular  metaphor  for  the  Markovian  case  is  radioactive  decay:  a  molecule  has  a  constant 
probability  per  unit  time  of  decaying.  As  a  metaphor  for  the  non-Markovian  case,  imagine  entering 
an  enormous  maze  and  then  trying  to  find  your  way  back  to  the  entrance.  The  longer  one  has 
wandered,  the  less  the  chance  of  stumbling  on  the  exit  in  the  next  df. 
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3.1  The  steady-state  solution 

As  a  first  illustrative  example,  suppose  that  the  replacement  rate  is  adjusted  so  that 
N  is  constant.  (Janitors  replace  light  bulbs  as  soon  as  they  burn-out.)  In  this  case  the 
equilibrium  solution  of  (37)  is 


f(a)  =  Nt 


(40) 


where 


tf(a)  =  exp 


(41) 


The  function  '5,  and  its  integral  r,  will  occur  frequently  in  the  sequel.  Notice  that  the 
replacement  rate  is  /( 0)  —  N/t  and  this  suggests  that  r  should  be  the  average  lifetime 
of  an  item.  On  the  other  hand,  r  will  not  usually  be  equal  to  a  in  (36).  I  suggest 
brooding  on  this  “paradox”  and,  as  an  exercise,  see  if  you  can  resolve  this  confusion  to 
your  satisfaction  by  the  end  of  this  section. 

In  (41)  we  assume  that  the  death  rate  a(a)  is  such  that  as  a  — >  oo,  \I>(a)  -4  0  fast 
enough  to  ensure  that  r  is  finite.  For  instance,  if  a  is  constant  (this  is  the  Markovian 
case)  then  \I/(a)  =  exp(— aa)  and  t  =  a  =  1/a. 

If  the  death  rate  a  decreases  with  age  then  the  average  liftetime  r  might  not  be 
finite.  For  example,  consider  the  specific  model 


a  — 


v 

0  +  a’ 


(42) 


Provided  that  v  >  1  then  the  integral  of  \k(a)  converges  and  t  —  6/{u  —  1). 

If  v  <  1  then  r  =  oo  and  there  is  no  steady  solution.  To  understand  this  curious 
result  we  must  solve  an  initial  value  problem  (see  appendix  A).  Here  we  just  remark 
that  if  v  < 1  then  the  average  lifetime  of  a  bulb  is  infinite.  Detailed  solution  of  the 
initial  value  problem  in  appendix  A  shows  that  in  this  case  the  replacement  rate  is 
/(0,  t)  oc  tv~l.  That  is,  the  total  number  of  new  bulbs  which  have  been  installed 
at  time  t  grows  like  tu  >C  t.  The  hypothetical  manufacturer  of  lightbulbs  with  v  < 
1  is  threatened  with  bankruptcy:  sales  decrease  with  time,  even  though  every  bulb 
eventually  fails. 


3.2  A  cohort  of  babies 

Imagine  a  cohort  of  babies  leaving  the  maternity  ward  together,  or  a  box  of  new 
lightbulbs  shipped  fresh  from  the  factory.  These  items  will  function  for  varying  amounts 
of  time,  and  so  we  can  speak  of  the  PDF  of  lifetimes.  We  denote  this  PDF  by  V(a), 
and  our  goal  is  to  relate  V(a)  to  the  death  rate  a(a). 
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Consider  a  group  of  N  items  which  all  start  with  a  =  0  at  t  =  0.  What  fraction  of 
this  cohort  survives  at  t  >  0?  The  surviving  fraction  is  also  the  fraction  of  lifetimes 
longer  than  t  and  so 

/OO 

V(a)da.  (43) 

To  calculate  the  surviving  fraction,  we  solve  (37)  with  the  initial  and  boundary  condi¬ 
tions 


/(a,  0)  =  NS(a),  /(0,f)  =  0. 


(44) 


The  solution  of  (37)  and  (44)  is 

f(a,t)  =  NV(t)8(a-t),  (45) 


where  is  defined  in  (41).  Thus  T(t)  is  the  fraction  of  the  cohort  which  is  still  alive 
at  time  i;  we  refer  to  T  as  the  survival  function. 

It  now  follows  from  (43)  that  the  PDF  of  lifetimes  of  new  items  is 

V(a)  =  -^a  =  cM.  (46) 


The  average  lifetime,  r,  is  given  by  the  equivalent  expressions: 


r  = 


L 


OO 

a^ada 


(47) 


Thus,  as  was  suggested  in  the  discussion  following  (41),  to  keep  a  population  in  equi¬ 
librium  the  replacement  rate  is  equal  to  the  size  of  the  population,  N,  divided  by  the 
average  lifetime  of  new  items,  r. 


3.3  Extinction  of  a  population 

As  a  final  example,  suppose  that  at  t  =  0  we  have  the  steady-state  lightbulb  population 
in  (40).  If  the  janitors  then  go  on  strike,  so  that  bulbs  burn  out  without  replacement, 
then  how  many  bulbs  are  still  operating  at  t  >  0?  In  this  example  we  must  solve  (37) 
with  the  initial  and  boundary  conditions  that 

/(a,  0)  =  Nr~l^(a),  /(0,f)  =  0.  (48) 

The  solution  is 

f(a:t)  =  H(a-t)NT~l*(a),  (49) 
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(50) 


where  H(a  —  t )  is  the  step  function.  Thus  the  fraction  of  surviving  bulbs  at  t  is 

/OO  poo 

^f(a)da  =  T~1J  (a  —  t)V(a)da. 

Using  the  specific  model  of  a  in  (42),  the  surviving  fraction  is 

Q(t)  =  (1  +  e~H)l-v.  (51) 

Q(t)  is  the  most  slowly  decaying  function  we  have  seen  so  far:  as  t  -»  oo,  0(t)  » 
m  >  V(t).  This  model  may  be  relevant  to  the  very  slow  extinction  of  professors 
once  the  supply  of  graduate  students  is  cut-off. 

Comparing  the  results  in  sections  3.1  and  3.2,  we  see  that  the  steady  state  popu¬ 
lation  in  section  3.1  contains  more  long-lived  items  than  are  in  a  cohort  of  new  items 
section  3.2.  This  means  that  the  average  lifetime  of  the  light  bulbs  currently  operating 
in  the  Empire  State  building  is  longer  than  the  average  lifetime  of  bulbs  shipped  from 
the  factory.  The  reason  is  obvious:  items  with  brief  lifetimes  fail  quickly,  and  will  likely 
be  replaced  with  items  whose  lifetime  is  closer  to  the  mean.  Thus,  fragile  individuals 
'-are  underrepresented  in  an  operational  population. 


4  The  generalized  telegraph  model 

4.1  Formulation 

Using  the  machinery  from  the  previous  section  we  now  construct  a  generalization  of  the 
telegraph  model  which  exhibits  anomalous  diffusion.  In  this  generalization  particles 
switch  randomly  between  moving  with  u(t)  =  +U,  u(t )  =  0  and  u(t)  =  —U.  The 
transition  probabilities  between  these  states  are  functions  of  the  time  since  the  last 
transition.  In  other  words,  each  particle  carries  an  “age”,  a,  which  is  the  time  elapsed 
since  the  particle  transitioned  into  its  present  state.  We  denote  the  density  of  right 
moving  particles  at  (x,  f),  with  age  a,  by  1Z(a,  x,  t).  For  left-moving  particles  the  density 
is  C(a.t.x),  and  for  the  stationary  particles  the  density  is  S(a,x,t).  We  refer  to  the 
left  and  right-movers  collectively  as  “flying  particles”  while  the  stationary  particles  are 
“stickers” . 

The  flying  particles  satisfy  the  conservational  laws 

IZt  +  +  UTZX  +  aF1l  =  0,  Ct  +  Ca  —  UCX  +  cxpJl  —  0 ,  (52) 

while  the  sticking  particles  have 


<5*  +  Sa  +  cxgS  —  0 .  (53) 

The  death  rates  of  flying  and  sticking  particles,  aF  and  0:5  respectively,  are  functions 
of  age  o;  it  is  through  this  device  that  particles  have  a  memory  of  their  previous 
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history.  The  price  we  pay  for  this  nonMarkovian  memory  is  that  there  are  now  three 
independent  variables,  ( a,t,x ). 

Stationary  particles  are  born  when  left  and  right-moving  particles  die.  And,  con¬ 
versely,  when  a  stationary  particle  dies  it  is  reborn  as  either  a  left  moving  particle  or 
a  right  moving  particle  with  equal  probability.  Notice  that  in  order  for  a  right-moving 
particle  to  become  a  left-moving  particle  it  must  pass  through  the  intermediate  state 
with  u  =  0.  These  karmic  rules  are  enforced  by  boundary  conditions  at  a  =  0: 


and 


1  r°° 

£(0,  t, x)  =  7£(0,  t, x)  =  -  /  as(a)S(a,t,x)  da, 

2  Jo 


poo 

S(0,t, x)  =  /  OLp(a)  [C(a, t,x)  +  TZ(a, t, x)]  da. 

Jo 


(54) 


(55) 


Trajectories  of  particles  moving  with  this  generalized  telegraph  process  are  shown  in 
figure  8. 

The  model  we  have  formulated  here  is  a  generalization  of  the  telegraph  model  in  two 
ways.  First,  there  are  three  states:  left,  right  and  stationary.  This  minor  embellishment 
is  motivated  by  the  Texas  experiments  in  which  trapping  in  a  vortex  corresponds  to 
the  stationary  particles.  The  nontrivial  generalization  is  the  introduction  of  the  age 
variable  used  to  capture  memory  effects.  As  an  exercise,  the  student  can  show  that  if 
otp  and  as  are  independent  of  a  then  one  can  easily  integrate  over  a  and  reduce  (52) 
through  (55)  to  a  three-state  telegraph  model.  (This  exercise  shows  how  the  boundary 
condition  at  a  =  0  works.)  As  a  sequel  to  this  exercise,  discuss  as  oo  and  show  that 
in  this  limit  one  obtains  effectively  a  two-state  telegraph  model.  Are  you  surprised 
that  the  diffusivity  is  given  by  (18)?  That  is,  why  do  we  recover  model  B,  rather  than 
model  A,  when  the  sojourns  in  the  intermediate  state  u  =  0  are  very  brief? 

In  order  to  model  slowly  fading  velocity  correlations  and  anomalous  diffusion  we 
use 


ap(a)  = 


vf 

6p  -(-  a 


as  (a) 


vs 

Os  +  a 


(56) 


With  the  form  above,  the  transition  rates  decrease  as  particles  age.  Numerical  sim¬ 
ulations  of  the  three-state  model  using  the  transition  rates  in  (56)  show  that  many 
particles  move  in  the  same  direction  for  a  long  time  (see  figure  9). 

The  main  point  of  (56)  is  that  if  a  1  then  the  transition  rates  ap  and  as 
are  proportional  to  a-1.  This  inverse  dependence  on  age  ensures  that  the  flying  and 
sticking  PDFs,  Vf  and  Vs  in  (7),  decay  algebraically.  Thus  (56)  incorporates  important 
experimental  information  into  the  model2.  One  can  make  a  dimensional  argument  in 

2 As  far  as  scaling  exponents  are  concerned,  only  the  a  1  structure  of  and  a$  matter.  We 
use  the  specific  functional  form  in  (56)  for  simplicity. 
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Figure  8:  Trajectories  of  particles  in  the  generalized  telegraph  random  process.  All 
particles  are  released  from  x  =  0. 

support  of  (56):  aF  and  as  have  the  dimensions  of  inverse  time.  If  the  only  time-scale 
relevant  for  long-lived  particles  is  the  particle  age,  a,  then  it  follows  that  ap  and  as 
are  inversely  proportional  to  a.  We  now  show  that  the  parameters  vF  and  vs  are  easily 
related  to  the  experimentally  measured  exponents  [iF  and  \xs  in  (9). 

4.2  The  equilibrium  solution 

The  system  (52)  through  (55)  has  a  solution  which  is  homogeneous  ( dx  =  0)  and  steady 
( dt  =  0).  This  equilibrium  solution  is 

TZ(a,x,t)  =  C(a,t,x)  =  r^fp(a),  S(a,t,  x)  =  2r4rs(a)  (57) 
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Figure  9:  A  simulation  with  N  =  104  particles;  aj?(a)  =  as(a )  ~  1.35/a.  Upper  panels: 
PDFs  as  a  function  of  age  and  position  show  that  there  are  many  particles  that  either  stick 
or  move  at  a  constant  velocity  for  nearly  the  whole  simualtion.  Center  panels.  PDFs  of  the 
position  of  particles  develop  tails  larger  than  Gaussians  as  time  goes  on.  Lower  panels:  PDFs 
of  the  age  of  particles  have  a  spike  at  large  times,  because  there  is  a  fraction  of  particles  that 
never  die. 
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where  ^f,s{o)  is 


^,5(0)  =  exp  J aF,s(a')  da'^  .  (58) 

The  constant  r  in  (57)  is  the  transition  rate  between  the  different  states;  r  is  determined 
by  the  normalization  condition: 

roo 

N  =  2r  (ts  +  Tp) ,  tf,s=  /  ^F,s(a)  do  •  (59) 

Jo 

We  can  use  the  results  from  section  3  to  interpret  17?  and  Ts  as  the  average  lifetimes 
in  the  flying  and  sticking  states  respectively.  Using  (46),  the  PDF  of  lifetimes  in  those 
states  is  given  by 


' Pf ,s  =  o^f,s^f,s  •  (60) 

Using  the  expression  in  (56)  for  07?  s,  we  see  that  as  a  — >  00,  the  survival  functions 
decay  algebraically  with  ^f,s  ~  a~VF<s,  and  so  Vf,s  ~  a~l'F's~1.  It  follows  that  the 
f'.xponents  fxp  and  /is  defined  in  (9)  are  related  to  vF  and  vs  by 


Vf,s  =  ^f,s  +  1  •  (61) 

\\V  can  summarize  our  arguments  to  this  point  by  observing  that  the  experiments 
provide  the  flying  velocity,  U,  the  average  lifetime  in  the  flying  and  sticking  states,  Tp,s, 
and  the  exponents  Hf,s-  These  five  experimental  data  determine  the  five  parameters 
in  the  generalized  telegraph  model,  namely  (U,ff,s^f,s)- 

4.3  Formulation  of  the  initial  value  problem 

Now  that  we  have  determined  the  model  parameters  using  experimental  constraints  it 
is  time  to  do  some  mathematics  and  use  the  model  to  predict  the  exponent  7  in  (6). 
The  simplest  intial  value  problem  we  can  consider  is  (52)  through  (55)  with 

[JZ(a,  0,  x),S(a,  0,  x),  C(a,  0,  x)]  =  r  [^(o)^^^),^^^)]^^)-  (62) 

The  constant  r  is  given  in  (57).  Thus,  the  initial  population  has  an  equilibrium  dis¬ 
tribution  of  ages  and  is  released  at  x  =  0.  Because  of  the  symmetry  between  left  and 
right  moving  particles 


1Z(a,t,x)  =  C(a,t,  —  x),  S(a,t,x)  =  S(a,t,—x).  (63) 

Equation  (63)  greatly  simplifies  subsequent  algebra. 

One  technical  point  (which  I  confess  confuses  me)  is  using  the  equilibrium  age 
distribution  as  the  initial  condition  in  (62).  This  choice  leads  to  simple  calculations 
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below.  And  perhaps  the  gross  details  of  the  dispersion  process,  such  the  exponent  7, 
are  independent  of  the  initial  distribution  of  ages?  As  an  excercise  I  suggest  solving  the 
initial  value  problem  using  other  initial  conditions  e.g.,  TZ(a, 0,x)  =  S(a)5(t)  etcetera. 
Are  there  any  significant  differences  in  the  if  ->  00  structure  of  the  solution? 

Our  strategy  will  be  to  obtain  a  closed  hierarchy  of  spatial  moments  by  multiplying 
the  conservation  laws  (52)  and  (53)  by  xn  and  integrating  over  x.  It  is  possible  to  solve 
the  first  few  members  of  the  hierarchy  and  show  that  if  aF,s  and  has  the  form  in  (56) 
with  1  <  v  <  2  then  as  t  — »  00 

roc  poo 

(x2)  =  /  /  x2  [Jl(a,t,x)  +  S(a,t,x)  +  £(a,t,x)]  dxda  a  tz~vp  .  (64) 

Jo  J -00 

Before  entering  this  calculation,  we  give  a  simple  argument  which  suggests  how  the 
anomalous  exponent  3  —  uF  >  1  arises  in  (64) . 

The  variance  (x2)  in  (64)  can  alternatively  be  written  as 

<*2>  =  ^XX  (65) 

71  —  1 

At  time  t  >  0  some  of  the  N  particles  will  have  moved  coherently  with  unchanging 
velocity  (either  +U  or  -U)  ever  since  t  =  0;  half  of  these  particles  will  be  at  x  =  Ut 
and  the  other  half  at  x  =  -Ut.  These  “coherent  particles”  each  contribute  a  term  U2t2 
to  the  sum  on  the  right  hand  side  of  (65).  The  number  of  coherent  flying  particles  is 
just  Q(t)N  where  0(t)  is  given  by  (51)  with  v  replaced  by  vF.  Thus,  because  every 
term  in  the  sum  in  (65)  is  positive,  one  has 

(x2)  >  e(t)u2t2  ~  (66) 

The  inequality  (66)  has  teeth  only  if  3  -  vF  >  1:  then  we  learn  that  the  coherent 
particles  alone  produce  a  superdiffusive  contribution  to  the  variance. 

The  argument  above  may  suggest  to  you  that  superdiffusion  is  due  solely  to  the  few 
extreme  particles  which  move  without  changes  in  direction.  This  is  an  overstatement: 
the  lower  bound  in  (66)  is  generously  less  than  the  exact  result  for  (x2)  which  we 
obtain  in  the  next  section.  Thus  “nearly-coherent”  particles,  meaning  particles  which 
change  direction  only  once  or  twice,  also  make  a  large  contribution  to  the  sum  in  (65). 
This  is  an  essential  point,  because  in  their  analysis  of  the  experiments  Solomon  et  al. 
discarded  all  coherent  particles  from  the  data  set3.  Thus  the  exponent  measured  by 
Solomon  et  al  reflects  only  the  contribution  of  nearly  coherent  particles. 

3  This  drastic  procedure  is  necessary  because  some  fraction  of  the  experime  ntal  particles  are  in 
integrable  regions  and  will  fly  forever.  Retaining  all  these  particles  will  ultimately  lead  to  the  trival 
ballistic  exponent  7  =  2. 
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4.4  Solution  of  the  initial  value  problem 

This  is  a  dry  section  which  contains  the  details  of  the  analytic  calculation  of  ( x 2).  The 
main  point  of  interest  here  is  that  a  lot  of  the  algebra  can  be  avoided  by  proving  (75) 
below.  (I  suggest  this  as  an  exercise.) 


The  spatial  moments  are  defined  by 

poo 

[Kn(a,t),Sn(a,t),£n{a,t)]=  xn 

J  —oo 

[JZ(a,  t,x),S (a,  t, x)C[a,  t,  x)]  dx, 

(67) 

Because  of  the  symmetry  in  (63) 

7 ln(a,t)  =  (— l)n£n(a,  t), 

Sn(a ,  t)  =  0  if  n  is  odd. 

(68) 

The  result  above  allows  us  to  work  exclusively  with  lZn  and  Sn  while  retaining  full 
information  about  the  distribution.  Using  the  symmetry,  the  variance  can  be  written 
as 


00 

2J?2  “I"  S2  da . 


(69) 


The  zeroth  moment  of  (52)  through  (55),  with  the  initial  condition  in  (62)  is 


[H0(a,  t),  <S0(g,  t),C 0(o,  t))  =  r  [^(a),  2<t5(a),  (a)] . 


(70) 


That  is,  the  zeroth  moment  is  just  the  equilibrium  solution.  (This  is  why  using  the 
equilibrium  age  distribution  as  the  initial  condition  is  so  convenient.) 

Using  (68),  the  first  spatial  moment  is  S\  —  0,  £i(a,  t)  =  —1Zi(a,t)  and 

'R'lt  +  'R-ia  +  OipTZi  =  Ur^Fi  72-1  (0,  t)  =  0,  7£i(a,  0)  =  0.  (71) 

The  solution  of  the  initial  value  problem  (71)  is 

lZi(a,  t)  =  C/r^F^)  min(a,  t).  (72) 

The  second  moment  equations  are  £2  =  772  and 

7?.2 1  +  72-2  a  +  OLpR.2  —  2UTl\,  S2t  +  S2a  +  asS2  =  0,  (73) 

with  the  a  =  0  boundary  condition  that 

roc  poo 

21Z2(0,  t)  =  /  o;5(a)«S2(a,  t)  da,  S2(0,t)  =  2  a.p{a)'R2(a,  t)  da.  (74) 

Jo  Jo 

To  obtain  the  variance  in  (69)  we  do  not  need  the  complete  solution  of  (73)  and  (74). 
Instead,  after  some  judicious  integration  over  a,  one  finds  that 

J  poo 

—  (x2)=AU  /  TZi(a,t)  do.  (75) 

dt  Jo 
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(76) 


Substituting  (72)  into  the  result  above  we  obtain 


^(x2)  =4  U2r 


/OO 

4'f(u) 


If  the  right  hand  side  of  (76)  approaches  a  constant  as  t  -*  oo  then  the  variance  grows 
diffusively.  Otherwise  there  is  anomalous  diffusion. 

With  (76)  in  hand,  one  can  easily  determine  if  particular  models  of  aF  and  'F F  lead 
to  anomalous  diffusion.  For  example,  with  the  model  in  (56),  evaluating  the  integrals 
in  (42)  gives  a  pleasant  exact  solution 


^<x2>  =  4  u\e\ 


(i  + 1) 


2— up 


+ 


[(2  —  uF)(uF  ~  1)  (uF-l)(uF-2) 


(77) 


where  i  =  t/9. 

The  asymptotic  behaviour  at  large  time  depends  crucially  on  uF.  If  uF  >  2  then 
t  here  is  normal  diffusion: 


_d 
d  t 


4£/2rfl2 

(uF  —  1  ){uF  —  2) 


+  0{t 2~^) . 


(78) 


If  1  <  uF  <  2,  there  is  superdiffusion 


4  f/2r^t~2-^ 

(2  -  uF)  (uF  -  1) 


+  0(1). 


(79) 


i  At  up  =  2  there  is  a  logarithmic  term.) 

Notice  the  minor  role  of  as  (a)  in  the  solution  above:  if  us  >  1,  so  that  the  mean 
st  irking  time  is  finite,  then  the  parameters  us  and  9s  occur  only  in  r. 


4.5  An  exercise  for  the  diligent  student 

Consider  an  asymmetric  two-state  model 


Ct  +  Ca  +  UlCx  +  aF(o)C  —  0, 

7 Zt  +  7 Zx  +  UrJZx  +  aji(a)TZ  —  0, 

(80) 

with  the  boundary  conditions 

poo 

£(0,t,x)=  /  aR(a)1Z(a,  t,  x)  da, 

Jo 

poo 

1Z(0,t,x)=  /  aL(a)£(a}t,x)  da. 

Jo 

(81) 

Show  that  the  average  velocity  is 

■q  _  rLUL  +  trUr 

Tl  +  Tr 

poo 

tl,r  =  /  ^L,fi(o)  da, 

Jo 

(82) 
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where  and  ^r  are  defined  by  analogy  with  (41).  Show  that  the  Laplace  transform 
of  the  velocity  autocorrelation  function  is  given  by 


where 


corr(s)  =  U2RMS 


U  +  Tr{  1  ~  V>l)(1  ~  $r) 

TlTR  s2(l  -  Mr) 


u2 


RMS 


Tl  +  Tr 

y/njR 


URUL. 


(83) 


(84) 


(If  you  use  the  moment  method,  you  will  need  Laplace  transforms  to  solve  the  integral 
equation  which  arises  at  n  =  1.)  Using  the  model 


&r,l(o) 


vr,l 

0R,L  +  a 


(85) 


perform  an  asymptotic  analysis  of  (83)  to  identify  the  anomalous  diffusion  exponents 
which  occur  if  either  or  both  of  vr  and  vr  are  less  than  2. 
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A  Solution  of  an  initial  value  problem 

In  this  appendix  we  discuss  the  issue  raised  at  the  end  of  section  3.1  and  analyze  a 

problem  in  which  the  death  rate  of  old  items  is  so  small  that  the  average  lifetime  r  is 

infinite.  For  example,  this  is  the  case  v  <  1  in  (42).  Specifically,  consider  the  initial 

value  problem  posed  by  (37)  with  the  initial  and  boundary  conditions 

/(°>0)  =  N5(a) ,  f(0,t)  =  r{t).  (86) 
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(87) 


In  (86)  the  replacement  rate  r(t)  is  determined  by  requiring  that 


The  solution  of  (37)  and  (86)  is 

/  =  NV(t)8(a  -t)  +  (a)r(t  -  a).  (88)' 


The  first  term  on  the  right  hand  side  of  the  equation  above  is  the  cohort  of  initial  items 
aging  and  dying.  The  second  term  is  influx  of  new  items.  Imposing  (87)  on  (88),  we 
obtain  an  integral  equation  for  r: 


N  =  N^(t)  +  f^(a)r(t  -  a)  da. 

Jo 


The  integral  relation  above  is  known  as  the  renewal  equation 
Because  of  the  convolution  in  (89),  the  Laplace  transform 

poo 

[*(«),  f(»)]  5  /  e— [*(<>), r(<0] da, 
Jo 

is  gratifying.  In  this  way  we  find  from  (89)  that 

1  - 


f  =  N- 


(89) 


(90) 


(91) 


The  large-time  behaviour  of  r(t)  can  be  obtained  from  (91)  using  standard  asymptotic 
methods. 

If  a(a)  oc  1/a  as  a  -»  oo,  then  the  rightmost  singularity  of  4r(s)in  the  complex 
s-plane  is  the  branch-point  at  s  =  0.  We  show  below  in  (94)  through  (97)  that  the 
structure  of  T  at  this  branch-point  is 


\?(s)  =  XjJSV  1  +  T  H - 


(92) 


If  u  <  1  then  the  singular  term  involving  sv  1  dominates  the  constant  r  as  s  -»  0.  In 
this  case,  from(91), 


f(s)  ~ 


N 

zosu'' 


=> 


r(t)  ~ 


ivr-1 

voY{v)  ’ 


as  t  — >  oo 


(93) 


Because  v  <  1  the  replacement  rate  vanishes  as  t  — >■  oo. 

To  explain  the  small-s  expansion  in  (92),  we  use  the  model  death-rate  in  (42),  which 
produces  the  survival  function 


*(»)  = 


(94) 
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The  Laplace  transform  in  (90)  is  then 

i <{s)  =  eusu~le9 T(1  -  v,  6s),  (95) 

where  F(o,  x)  is  the  incomplete  T-function  defined  by  Abramowitz  &  Stegun  in  their 
article  6.5.3.  This  Laplace  transform  can  be  rewritten  as 

=  rs-V-ru - u)  - or(i -  v) £  *  j  .  (96) 

n— 0  '  ' 

The  form  above  is  convenient  because  the  singular  terms  containing  s"-1  are  localized 
in  the  first  function  on  the  right  hand  side.  When  s  <C  1  the  expansion  of  (96)  is 

(s)  =  0T(  1  -  v)sv~1  +  +  0(s,  su) ,  (97) 

which  is  the  form  assumed  in  (92). 
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Lecture  6: 

The  Batchelor  spectrum  and  tracer  cascade 

R.  T.  Pierrehumbert 


1  The  advection-diffusion  equation 

In  this  lecture  we  consider  the  problem  of  determining  the  spectrum  of  a  passive  tracer  advected  by 
a  velocity  field.  The  evolution  of  the  tracer  is  described  by  the  equation 


dte  +  u  •  v©  =  kv2©,  (l) 

where  n  is  the  molecular  diffusivity,  u  the  advection  velocity  field  and  dt  denotes  a  partial  derivative 
with  respect  to  time.  Depending  on  the  particular  experimental  situation,  the  tracer  ©  could  be 
temperature  or  density  of  a  passive  contaminant  such  as  ink.  By  0  being  passive,  we  mean  that 
the  dynamics  of  the  velocity  does  not  couple  with  0.  Thus  equation  (1)  is  truly  linear  in  0.  We 
confine  our  attention  to  the  case  where  u  is  non-divergent.  Most  of  the  mathematical  details  will  be 
carried  out  in  the  two  dimensional  case,  though  many  of  the  techniques  and  arguments  admit  ready 
generalizations  to  any  number  of  dimensions.  Throughout,  we  employ  cartesian. coordinates  ( x,y ). 

Batchelor  [1]  realized  that  the  general  increase  of  gradients  of  0  accompanying  the  stirring  action 
of  the  velocity  field,  which  is  a  consequence  of  the  quadratic  term  of  (1),  can  also  be  thought  of  as 
a  transfer  between  different  Fourier  components  of  the  spectrum  of  0.  If  both  u  and  0  are  written 
in  the  form  of  Fourier  integrals,  the  term  u  •  V0  leads  to  the  generation  of  new  harmonics  of  0  and 
the  growth  of  ever-increasing  wavenumbers.  The  transfer  of  tracer  variance  from  low  wavenumbers 
to  high  wavenumbers  is  mathematically  similar  to  that  hypothetised  by  Kolomogorov  for  the  velocity 
variance  in  a  turbulent  field. 

The  Fourier  components  of  the  tracer  0,  if  the  flow  is  spatially  unbounded,  are  defined  by 

1  oo  r-f-oo 

Q(x,y)  =  7^T2  /  /  dk  dl  Q(k,  l)eikx+ily .  (2) 

(At  J-oc  J-oo 

In  the  present  context,  it  is  convenient  to  define  a  spectrum  function  for  0  as 

=  (3) 

where  ©(#,</>)  are  the  Fourier  components  of  0  in  polar  coordinates  K  =  \/k 2  T  l 2  and  <j>  = 
arctan(//fc).  If  the  tracer  field  is  isotropic  the  integral  is  trivial  and  C(K )  =  K\Q(K))\2  /  2irV .  The 
normalization  factor  V  =  f  fv  dx  dy  ensures  that  the  spectrum  has  units  of  variance  of  ©  per  unit 
wavenumber.  In  fact  it  follows  from  Parseval’s  equality  that  the  variance  of  0  is  the  integral  of  C(K) 
in  wavenumber  space 

(02>  =  li miff  dxdye2(x,y)=  [+°°dKC(K).  (4) 

V-»oo  V  J  Jy  Jo 
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Following  the  line  of  argument  of  Kolmogorov  and  Batchelor,  we  suppose  that  the  diffusivity  k 
is  so  small  as  to  make  the  effect  of  diffusion  appreciable  only  at  the  large  wavenumber  end  of  the 
spectrum.  The  part  of  the  equilibrium  range  of  wavenumbers  for  which  the  Fourier  components  of 
u  are  independent  of  viscosity  is  usually  termed  the  “inertial  subrange”  and  an  appropriate  term 
for  the  part  of  the  equilibrium  range  for  which  the  Fourier  components  of  0  are  independent  of 
molecular  diffusion  is  the  “advection  subrange”.  No  actual  destruction  of  tracer  variance  takes  place 
at  wavenumbers  in,  or  smaller  than  those  in  the  advection  subrange.  All  the  destruction  takes  place 
at  high  wavenumbers  as  a  result  of  the  action  of  molecular  diffusion.  The  total  rate  of  destruction  of 
variance  per  unit  volume  is  calculated  by  integrating  the  advection-diffusion  equation  over  the  whole 
domain 


dt{Q2)  =  “2k  ( |V0|2 )  =  — x*  (5) 

This  relation  shows  that  the  quadratic  terms  in  (1)  makes  no  contribution  to  9*(02).  Thus  when 
one  Fourier  component  of  the  spectrum  of  0  is  changed  by  the  interaction  between  0  and  u,  other 
Fourier  components  are  changed  simultaneously  in  such  a  way  that  the  sum  of  the  contributions  to 
(02)  from  all  Fourier  components  remains  the  same.  This  shows  that  0  variance  is  simply  transferred 
from  small  to  large  wavenumbers  in  the  advection  subrange  and  x  is  a  given  constant  quantity. 

If  the  velocity  field  u  is  characterized  by  a  single  time  scale  r,  it  is  possible  to  predict  the  spectrum 
of  0  on  dimensional  grounds.  In  fact  the  spectrum  C(K)  at  a  wavenumber  K  is  determined  by  x> 
the  mean  rate  at  which  variance  is  cascaded,  the  time  scale  r,  and  the  local  wavenumber  K .  There 
is  only  one  combination  of  these  three  parameters  with  dimensions  of  (02)  per  unit  wavenumer  and 
this  combination  gives  the  functional  form  of  the  spectrum, 

C{K)~XtK-\  (6) 


Batchelor  noticed  that  this  result  is  of  physical  interest  for  tracers  whose  diffusivity  is  much  smaller 
than  the  viscosity  of  the  advecting  fluid.  In  this  case  the  velocity  field  at  scales  shorter  than  the 
viscous  cutoff  is  extremely  smooth  and  the  primary  effect  of  the  advection  on  variations  of  0  is  a 
uniform  straining  rate  of  strength  r"1. 

A  characterization  of  the  spectrum  of  0  can  be  obtained  also  at  scales  larger  than  the  viscous 
cutoff,  if  the  velocity  spectrum  has  a  power  law  dependence  on  the  wavenumber.  The  spectrum  of 
velocity  £ (K)  is  the  variance  of  velocity  as  a  function  of  wavenumber  and  is  defined  in  a  way  analogous 
to  the  spectrum  of  0  such  that 

1  r+(X) 

±<u.u)  =  j[  dK£(K).  (7) 


If  £(K )  ~  a  time-scale  r  for  the  velocity  field  is  given  by  the  turnover  time  at  scale  K , 

K-1 


T  ~ 


VkW) 


,  3-or)/2. 


(8) 


The  eddy  turnover  time  decreases  with  scale  for  a  <  3  and  the  picture  of  larger  eddies  feeding  smaller 
eddies  is  appropriate.  For  a  >  3  the  cascade  of  energy  through  different  scales  cannot  be  considered 
local  in  wavenumber  space  and  the  estimate  of  r  must  be  corrected  to  include  the  presence  of  non  local 
effects,  but  this  is  beyond  the  scope  of  this  lecture.  Given  an  estimate  for  r,  the  same  dimensional 
argument  used  for  the  case  of  a  velocity  field  characterized  by  only  one  time  scale  implies 

C(K)  ~XK~{5~a)/2-  (9) 


According  to  Kolmogorov’s  arguments  on  isotropic  and  homogeneous  turbulence,  the  spectrum  of 
velocity  in  the  inertial  subrange  scales  as  £  (K)  ~  K“5/3.  In  this  case  the  tracer  spectrum  scales  as 
C(K)  ~  K“5/3  as  well  and  is  known  as  the  Obukhov-Corrsin  spectrum  [2,  3]. 
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Figure  1:  Spectrum  of  a  passive  tracer  advected  by  a  3D  turbulent  field.  Wavenumbers  are  normalized 
by  the  cutoff  viscous  scale  (the  scale  at  which  momentum  is  dissipated). 

For  tracers  advected  by  a  turbulent,  three-dimensional  velocity  field,  with  diffusivity  k  much 
smaller  than  the  viscosity  of  the  fluid  we  expect  to  see  a  spectrum  C(K )  ~  K~ 5/3  at  scales  larger 
than  the  viscous  cutoff,  and  a  spectrum  C(K)  ~  K~x  at  smaller  scales,  as  shown  in  Figure  1.  The 
scaling  breaks  off  at  large  wavenumbers  at  the  dissipation  cutoff  (the  scale  at  which  the  tracer  is 
dissipated,  here  assumed  smaller  than  the  scale  at  which  momentum  is  dissipated)  and  at  small 
wavenumbers  at  the  scale  of  the  domain,  if  the  domain  is  finite,  or  at  the  scale  of  forcing  if  there  is 
some  forcing  feeding  variance  in  0. 

2  The  2-point  correlation  function:  what  can  it  tell  us  about 
the  properties  of  the  flow. 

2.0.1  Definition  and  uses  of  the  2-point  correlation  function 

The  aim  of  the  two  point  correlation  function  Z^ir)  of  a  concentration  field  is  to  yield  information 
about  the  typical  variation  of  the  concentration  over  a  distance  r.  Let’s  therefore  define  it  as 

Z2(r;...)  =  (|0(x  +  rn)-0(x)|2)  (10) 

where  n  is  a  unit  vector  in  the  chosen  direction,  and  the  average  can  be  a  time,  spatial,  or  ensemble 
average.  Depending  on  the  average  chosen,  ^(r;...)  may  also  depend  on  the  time  t  (for  spatial  or 
ensemble  average),  or  the  starting  position  x  (for  the  time  average  if  the  fluid  is  not  homogeneous). 
If  the  flow  is  anisotropic,  the  correlation  function  depends  on  n  as  well. 

The  2-point  correlation  function  can  also  be  rewritten  as 

Z2(r, n)  =  (0(x)2)  +  (0(x  +  rn)2)  -  2<0(x)0(x  +  rn)) 

=  2{02)  —  2{0(x)0(x  4-  rn))  (11) 

if  the  flow  is  spatially  homogeneous.  More  generally,  one  could  define  correlation  functions  of  any 
order: 


Zff(r)  =  (|0(x  +  rfi)-0(x)r). 


(12) 
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As  q  becomes  larger,  Zq  is  increasingly  dominated  by  the  most  extreme  fluctuations  (i.e.  0(x  +  rn)  — 
0(x)  locally  very  large). 

The  2-point  correlation  function  yields  precious  information  about  the  global  spatial  structure  of 
the  flow.  For  instance, 

•  if  the  flow  is  smooth  at  all  points  ,  then  the  concentration  gradient  exists  everywhere  and 
|V0|  <  +oo.  As  a  result,  for  small  enough  r,  we  get  Z2(r)  »  (|V0|2)r2; 

•  for  a  flow  containing  well  separated  jumps/steps  in  the  concentration  field,  then  Z2(r)  oc  r; 

•  for  a  non-singular  fractal  flow,  with  a  possible  local  accumulation  of  steps,  the  correlation  func¬ 
tion  becomes  Z2(r)  oc  rv\  where  1  <  v  <  2.  Smaller  values  of  v  arise  form  singularities  in  the 
tracer  field — places  where  the  tracer  value  becomes  infinite; 

•  if  the  flow  contains  integrable  singularities,  then  Z2(r)  oc  rv  with  0  <  v  <  1,  whereas  non- 
integrable  singularities  have  v  <  0; 

•  for  a  flow  with  a  white  noise  spectrum,  there  is  no  correlation  between  any  two  points,  so  that 
the  correlation  function  is  constant  Z2(r)  oc  r°. 

To  summarize,  non-singular  flows  have  2-point  correlation  functions  given  by  Z2(r)  oc  rv  where 
0  <  v  <  2.  There  exists  no  interesting  flows  with  v  >  2  since  this  would  require  the  gradient  of 
the  velocity  to  be  0  everywhere. 


2.1  Relation  between  the  2-point  correlation  function  and  the  power  spec¬ 
trum 


We  saw  that  the  flow  variance  can  be  decomposed  onto  the  spectral  modes  as 

(02)  =  f °°C(K)dK 

Jo 


(13) 


where,  say.  C(K)  ct  K  a.  In  the  specific  case  of  1-D,  one  can  show  that  the  correlation  function  can 
also  be  rewritten  as  [4,  p.  95] 


C(K)(1-  cos(Kr))dK 


(14) 


so  that 


Zi(r)  = 


ri/r  poo 

2  /  C(K)K2r2dK  +  2  C(K)(1  -  cos(KR))dK 

JO  J  l/r 

2  /  7  C(K)K2r2dK  +  2  [  C(K)dK 
Jo  Jl/r 


2  r2 


K3-a 


3  —  a 


l/r 

1/r  rjri-a-i°o 


+  2 


1  -  a 


l/r 


(15) 


Note  that  only  the  tail  of  C(K)  varies  like  K  a,  so  that  the  0  bound  of  the  integral  does  not  actually 
pose  any  problems,  and  is  a  given  constant,  say  Co-  Various  cases  can  occur. 

•  in  the  case  where  a  >  3  then  the  integral  is  dominated  by  the  first  term,  and  so  Z2(r)  « 
{3-a)(cx-i)ra~1  +  2r2Co-  For  small  r  the  dominant  term  is  therefore  Z2(r)  oc  r2. 
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in  the  case  1  <  a  <  3  then  the  integral  is 


Z2(r)  «  2 r2 


-a— 3 


+  2r2Co  +  2- 


—  a  a  —  1 

For  very  small  r,  the  dominant  term  is  ^(r)  a  ra_1 


=  ra  1 


(3 -  a)  (a  -  1) 


+  2r2Co 


(16) 


•  in  the  case  where  a  <  1  the  second  integral  is  not  convergent;  Z<i(r)  is  not  defined  in  that  case. 
However,  in  the  limit  a  — >  1  and  a  >  1,  the  dominant  terms  are 


a(r)  ”  (3 - <.)(<< -  1)  +  (3~-o) lnr  “  fe)  (17> 

so  that  the  length-scale  r*  appears  as  a  cutoff  below  which  the  flow  is  smooth  on  all  scales.  The 
limit  a  —y  1  corresponds  to  the  Batchelor  spectrum,  and  r*  is  the  dissipation  scale. 

Note  that  in  the  case  where  a  is  greater  than  3,  the  correlation  function  increases  faster  than  r2, 
which  cannot  correspond  to  any  physical  flow.  We  also  see  that  provided  1  <  a  <  3  the  following 
simple  scaling  argument  applies:  (r)  is  related  to  the  variance  on  a  scale  of  r,  so 


Z2(r)  «  C(K)6K*  K ~aK  where  K  =  1/r 


(18) 


3  Determination  of  ^(r)  from  the  mixing  properties  of  the 
flow 


3.1  The  non-diffusive  case 


In  the  case  where  the  equation  governing  the  spread  of  concentration  contains  no  source  terms,  or 
diffusive  terms,  we  have 


(19) 


so  in  order  to  know  0(x,  t)  we  only  need  to  track  the  trajectories  back  in  time  to  the  initial  conditions. 
The  correlation  between  the  concentrations  of  2  points  x  and  x'  in  the  fluid  is  equal  to  the  correlation 
of  the  concentration  of  xo  and  Xq  in  the  initial  conditions,  where  x0  and  Xg  are  the  initial  positions 
of  x  and  x' .  However,  we  also  know  from  the  stretching  properties  of  the  fluid  that  any  2  points  grow 
exponentially  further  apart  as  time  evolves  (either  forwards  or  backwards).  If  x  and  x'  are  separated 
by  r  at  a  time  t ,  x0  and  Xg  were  separated,  on  average,  by  r0  =  rexp(At)  at  to  =  0?  where  A  is 
the  finite  time  Lyapunov  exponent  of  the  flow.  If  the  initial  conditions  of  the  flow  have  a  typical 
correlation  length-scale  L  (for  instance,  the  length-scale  of  the  initial  forcing  in  a  decaying  turbulence 
experiment),  then  there  will  be  2  regimes: 

•  either  rexp(At)  <  L,  then  the  particles  have  remained  in  the  same  “eddy”,  within  the  same 
correlation  length-scale,  so  that  we  simply  have 


Ar0  «  |V0|or  exp(At) 


(20) 


where  |V0|o  is  the  typical  gradient  of  the  eddy  at  t  =  0. 

•  on  the  other  hand,  if  rexp(At)  >  L  the  points  are  uncorrelated  and  so  (|Ar0|2)  =  2(02)|o, 
where  (02)|o  is  the  variance  of  the  initial  flow. 

The  resulting  profile  is  shown  in  in  Fig.  2;  as  one  can  see,  there  seems  to  be  no  logarithmic  profile 
appearing,  which  would  have  been  the  sign  of  the  Batchelor  regime.  This  is  typical  of  the  initial 
value  problem,  where  the  flow  keeps  a  memory  of  the  initial  condition  for  some  period  of  time  before 
reaching  a  random  flow. 
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Figure  2:  two-point  correlation  function  in  the  unforced,  non-drffusive  case. 

3.2  The  forced  case 

This  time  we  have 

^=KV20  +  5(x,t)  (21) 

where  S(x,  t)  is  a  source  term,  with  a  finite  correlation  length-scale,  but  no  time  correlation,  i.e. 

(S(x,t)S(x',t'))  =  5q(|x  -  x'| )5(t  - 1')  (22) 

where  50(|x  -  x'|)  decays  on  a  lengthscale  L.  One  can  distinguish  2  regimes.  In  the  initial  phase  the 
flow  is  mixed  without  dissipation;  since  the  forcing  is  uncorrelated  in  time,  the  system  behaves  like  a 
random  walk  process,  so  that 

(02)  oc  Sgt  (23) 

Later  on,  the  flow  reaches  a  steady  state  with  a  balance  between  the  forcing  and  the  mixing/ dissipation 
terms.  The  dissipation  scale  Z*  for  which  the  strain  balances  the  dissipation  is  Z*  =  y/ft/ A  (cf.  Bill’s 
len  ure) .  The  time  needed  for  the  flow  to  create  structures  on  the  dissipation  scale  is  £*  =  ^  ln(L/Z*). 
The  time-scale  can  be  seen  as  the  memory  of  the  system.  Trajectories  with  small  Lyapunov  exponent 
have  a  large  memory,  the  flow  remains  correlated  for  longer  times.  The  2-point  correlation  function 
will  mostly  depend  on  scales  of  order  of  L.  Indeed,  if  2  points  are  separated  by  r  <  X,  they  must 
come  from  regions  which  have  the  same  source  term,  so  that  the  quantity  ^(r)  is  likely  to  be  small. 
On  the  other  hand,  if  r  >  L  then  the  correlation  function  of  the  concentration  field  will  be  similar  to 
that  of  the  source  terms.  As  before,  let’s  consider  2  points  at  x  and  x'  at  a  time  t  and  trace  their 
trajectories  backward  in  time.  Assuming  that  the  statistics  of  the  forced  flow  are  stationary,  the  flow 
builds  up  correlation  when  the  separation  of  the  2  points  is  larger  than  L,  but  loses  correlation  for 
times  larger  than  the  dissipation  time-scale.  The  2-point  correlation  function  will  therefore  depend 
on  the  difference  tx  -  U,  where  ti  is  the  amount  of  time  necessary  to  reach  scales  of  order  L  starting 
from  and  initial  separation  r:  £x  =  j  In  L/r.  This  is  illustrated  in  Fig.  2. 

Hence 

Z2(r)  oc  tL  -  i*  oc  i  In  (r/i*)  (24) 

This  logarithmic  dependence  in  r  of  the  correlation  function  shows  that  the  forced  case  is  consistent 
with  a  Batchelor  regime.  For  more  details  on  this  subject,  see  Refs.  [5]  and  [6]. 
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Figure  3:  Evolution  of  the  separation  of  2  points,  and  relevant  time-scales. 


References 

[1]  G.  K.  Batchelor,  “Small-scale  variation  of  convected  quantities  like  temperature  in  turbulent  fluid, 
part  1.  general  discussion  and  the  case  of  small  conductivity,”  J.  Fluid  Mech.  5,  113  (1959). 

[2]  A.  M.  Obukhov,  “Structure  of  the  temperature  field  in  turbulent  flow,”  Izv.  Akad.  Nauk.  Geogr. 
Geofiz.  13,  58  (1949). 

[3]  S.  Corrsin,  “On  the  spectrum  of  isotropic  temperature  fluctuations  in  an  isotropic  turbulence,”  J. 
Appl.  Phys  22,  469  (1951). 

[4]  A.  S.  Monin  and  A.  M.  Yaglom,  Statistical  Fluid  Mechanics  (MIT  Press,  Cambridge,  MA,  1981). 

[5]  R.  T.  Pierrehumbert,  “Lattice  models  of  advection- diffusion ,”  Chaos  10,  61  (2000). 

[6]  I.  K.  M.  Chertkov,  G.  Falkovich  and  I.Lebed,  “Statistics  of  a  passive  scalar  advected  by  a  large- 
scale  two-dimensional  velocity  field:  and  analytic  solution.,”  Phys.  Rev.  E  51,  5609  (1995). 


96 


On  Stratified  Kolmogorov  Flow 

Yuan-nan  Young 


Abstract 

In  this  study  we  investigate  the  stability  of  the  weakly  stratified  Kolmogorov  shear 
flow.  We  derive  the  amplitude  equations  for  this  system  and  solve  them  numerically  to 
explore  the  effect  of  weak  stabilizing  stratification.  We  then  explore  the  non-diffusive 
limit  of  this  system  and  derive  amplitude  equations  in  this  limit. 


1  Introduction 

The  Kolmogorov  flow  -  a  two-dimensional  viscous  sinusoidal  flow  induced  by  a  unidirectional 
external  force  field  -  has  been  studied  in  the  context  of  generation  of  large  scale  turbulence  in 
two-dimension.  Various  aspects  of  the  Kolmogorov  flow,  such  as  the  generation  of  2D  turbu¬ 
lence  [1],  vortex  merging  [2],  and  the  negative  viscosity  in  the  role  of  large  scale  formation  in 
2D  turbulence  [3],  have  been  widely  applied  to  geophysical  [4]  and  laboratory  systems  [5,  6]. 

In  this  study  we  impose  a  weak,  stabilizing  temperature  gradient  and  investigate  the  tem¬ 
perature  evolution  associated  with  the  flow  instability.  We  first  adopt  Sivashinsky’s  approach 
and  derive  the  finite-amplitude  equation  for  the  case  of  infinite  domain  (periodic  boundary 
conditions)  and  finite  Peclet  numbers.  We  then  solve  the  amplitude  equations  (both  ID  and 
2D)  numerically  and  investigate  the  buoyancy  effect  on  the  structure  formation  of  the  flow. 
We  also  investigate  large  Peclet  number  cases,  where  the  critical  layers  in  the  scalar  field 
plays  a  key  role  for  the  flow  instability  and  dynamics.  We  also  make  comparison  between 
fully  numerical  simulations  and  results  from  the  weakly  nonlinear  analyses. 


2  Formulation  and  linear  analysis 

2.1  Formulation 

The  Kolmogorov  shear  flow  is  more  generally  defined  as  a  sinusoidal  shear  flow,  whether  the 
fluid  is  viscous  or  inviscid.  In  our  2-D  formulation  of  the  problem,  where  the  incompressible 
flow  can  be  written  as  a  stream  function,  we  couple  this  background  shear  flow  to  a  stabilizing 
temperature.  Without  loss  of  generality,  we  write  the  total  background  state  as  a  sinusoidal 
stream  function  and  a  linear  temperature  profile: 

AT 

y0  =  U0lcos(z/l),  T0  =  —z,  (1) 

where  Uo  is  the  amplitude  of  the  background  shear  flow,  l  is  the  periodicity  of  the  shear  flow, 
and  AT  is  the  temperature  difference  across  distance  l.  Denoting  •0  as  the  stream  function 
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disturbance  and  6  as  the  temperature  disturbance,  we  first  write  the  momentum  equation 
and  the  advection  diffusion  equation  as  follows: 


dtV2iP  +  0VV  -  -  J{i>,  V2V0  =  i/VV  -  (2) 

3*0  +  dzV0dxe  -  dxipdzTo  -  J(V»,  6)  =  kV20.  (3) 

\\v  nondimensionalize  the  above  equations  such  that  the  background  shear  flow  uq  =:  —  sin  z 
and  the  background  stabilizing  thermal  gradient  is  equal  to  1  (Re  =  UqI/v  is  the  Reynolds 
number,  Pe  =  UqI/k  is  the  Peclet  number,  and  Ri  =  ga5Tl2 /Uq  is  the  Richardson  number), 
and  equations  2  and  3  thus  read: 

dtV2ip  -  sin  z(S72ip  +  ip)x  -  J('ip,V2‘ip)  =  ^VV  “  Ri#x,  (4) 

dt0  -  (sinzO +  tp)x  -  J(ip,6)  =  ^-V29.  (5) 


In  the  following  subsections  we  first  present  results  from  the  usual  linear  analysis  on  cases 
where  the  periodicity  of  the  shear  flow  is  the  same  as  the  domain  considered  (integer  pe¬ 
riodicity).  We  then  consider  cases  where  the  perturbations  are  products  of  periodic  and 
exponential  functions  (Floquet  system)  and  may  exhibit  parametric  resonance. 

2.2  Linear  analysis  on  the  stratified  Kolmogorov  flow 

In  this  section  we  present  results  of  linear  analysis  on  equations  4  and  5  for  shear  flow  of 
periodicity  the  same  as  the  domain  size.  The  linearized  version  of  equations  4  and  5  read 

dtV2V’  -  smzdx(V2'ip  +  ip)  =  =|-VV  —  Ri#x,  (6) 

Ke 

dt9  -  (sin z0  + ip) x  =  (7) 

Without  the  stabilizing  temperature,  the  non-stratified  Kolmogorov  shear  flow  is  known  to  be 
unstable  to  long  wave  length  perturbation  for  Reynolds  numbers  Re  >  V2:  the  critical  wave 
number  kc  =  0  and  the  critical  Reynolds  number  Rec  =  y/2.  Also  for  small  horizontal  wave 
numbers  ( k  <C  1)  the  growth  rate  A  can  be  obtained  via  the  following  dispersion  relation: 

A  =  (1  —  — )fc2  +  Re2(l  +  ~^~)k4  +  0(ke).  (8) 

We  numerically  solve  the  above  linearized  equations  with  periodic  boundary  conditions 
(in  both  the  horizontal  and  the  vertical  directions).  Figure  1  shows  marginal  curves  for  weak 
stratification  (see  caption  for  the  corresponding  stratification  strength  for  each  curve)  and 
figure  2  shows  the  critical  Reynolds  numbers  and  wave  numbers  as  functions  of  Richardson 
numbers  for  Prandtl  number  Pr  =  v/k  =  10.  As  shown  in  the  figures,  the  critical  wave 
number  kc  increases  rapidly  as  we  increase  the  Richardson  number  above.  As  the  Richardson 
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(a) 


(b) 


Figure  1:  Marginal  curves  for  the  unbounded  stratified  Kolmogorov  flow  for  weak  stratifica¬ 
tion  and  Prandtl  number  a  =  1.  Curves  are  labeled  by  their  Richardson  numbers.  In  Panel 
(a),  from  curve  1  to  curve  6,  the  Richardson  number  is,  respectively,  10~7,  10~6,  10-5,  10  , 

10-3,  and  10-2.  In  Panel  (b),  the  Richardson  number  is  0.01,  0.05,  0.1  and  0.15  for  curve  1 
to  curve  4,  respectively. 
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(a) 


(b) 


Figure  2:  Critical  Reynolds  number  (a)  and  critical  wave  number  (b)  as  functions  of  Richard- 
son  number  for  Pr  =  10  for  the  periodic  case. 


number  increases  above  10-5,  the  critical  wave  number  kc  increases  significantly  from  0 
towards  some  finite  value  0.1).  This  also  implies  that  the  inverse  cascade  observed  in  the 
nonstratified  Kolmogorov  shear  flow  is  at  risk,  namely,  the  large  scale  perturbation  now  has 
been  stabilized  by  the  existence  of  the  stably  stratified  temperature.  As  will  be  shown  in  the 
numerical  results,  the  inverse  cascade  is  indeed  prevented  by  the  stabilizing  temperature  and 
we  will  discuss  this  in  detail  via  the  tool  of  Lyapunov  functional. 

2.3  Linear  analysis  on  the  stratified  Kolmogorov  flow: 

Floquet  calculation 

We  now  show  results  from  the  Floquet  calculation  for  the  stratified  Kolmogorov  shear  flow. 
We  perturb  the  system  with  perturbation  of  the  form:  where  0  <  q  <  0.5 

is  the  Floquet  multiplier  (Bloch  number)  and  k  is  the  horizontal  wave  number.  With  the 


definition  of  V'2  in  equation  9, 

\7'2tp  =  -{k2  4-  q2)ip  +  2iqdzip  +  d^ip, 

(9) 

equations  6  and  7  then  take  the  following  form: 

dtV2ip  —  sin  z  ik(V'2ip  +  ip)  = 

-J-V'V  -  ikRid, 

Re 

(10) 

dtO  —  ik(sin  z  6  +  ip)  = 

i-V,20. 

Pe 

(11) 

Solving  equations  10  and  11  numerically  with  periodic  boundary  conditions,  we  obtain  the 
parametric  marginal  curves  for  various  values  of  q.  Figure  3  shows  the  critical  Reynolds 
number  as  a  function  of  q  for  the  non-stratified  (Ri  =  0)  case.  We  first  note  that  R ec(q)  > 
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R ec(q  =  0),  and  at  around  q  =  0.35  the  minimum  moves  from  one  branch  to  the  other,  thus 
a  cuspy  transition  at  q  =  0.35.  From  figure  3  we  also  note  that  the  most  unstable  mode  has 
the  same  periodicity  as  the  background  shear  flow.  Therefore,  we  do  not  need  to  perform 
the  same  Floquet  calculation  for  the  stratified  case.  Having  shown  that  perturbations  of 


(a)  (b) 


Figure  3:  Critical  Reynolds  number  (a)  and  critical  wave  number  (b)  as  functions  of  the 
Floquet  multiplier  q  for  the  non-stratified  Kolmogorov  shear  flow. 

the  same  periodicity  are  the  most  unstable,  we  then  proceed  to  uncover  the  effect  of  weak 
stratification  on  the  nonlinear  behavior  of  the  flow.  By  this  we  in  particular  mean  that  we 
are  going  to  perform  amplitude  expansion  around  the  k  =  0  mode  for  small  Ri.  From  the 
linear  analysis,  we  have  observed  that  the  critical  wave  number  kc  increase  from  zero  (for  no 
stratification,  Ri  =  0)  to  finite  value  (for  strong  stratification).  As  kc  transitions  from  0  to 
finite  values,  the  amplitude  equation  changes  from  a  Cahn-Hilliard  like  equation  [4]  for  long 
wavelength  instability  to  a  Ginzburg-Landau  equation  for  finite  wavelength  instability.  In 
our  weakly  nonlinear  analysis,  we  focus  on  the  weak  stratification  limit  where  the  system  still 
inherits  the  instability  to  long  wavelength  perturbation.  To  have  buoyancy  (0)  appear  at  the 
desired  order  in  the  amplitude  equation,  we  rescale  9  and  put  Ri  to  small  numbers  such  that 
Ri  =  €6Fq  and  b  =  Ri 6/eb  =  eF^d.  Equations  4  and  5,  in  the  new  scaling,  take  the  following 
form: 

e4aTVV  -  eJ^ip,  V2</>)  -  esin z(V2ip  +  ^  =  ^(1  -  e2)VV  -  (12) 

eAdrb  —  eJ^(tp,  b )  —  e2Fe‘ip£  —  e  sinzb £  =  V22>,  (13) 

where  is  the  usual  Jacobian  with  respect  to  £  and  z. 

3  Weakly  nonlinear  analysis:  Pe  0(1) 

In  this  section  we  first  construct  the  amplitude  equations  for  the  stratified  shear  flow  with 
Pe  ~  0(1).  We  remark  here  that  we  are  mostly  interested  in  two  ranges  of  Pe:  Pe  0(1) 
and  Pe  )$>  1.  The  range  Pe  <C  1  is  where  molecular  diffusivity  dominates  the  dynamics,  and 
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to  first  few  orders  in  the  expansion,  there  appears  to  be  no  coupling  between  the  temperature 
and  the  flow,  and  thus  is  of  no  interests  in  our  analysis.  In  real  physical  systems  (salty  water, 
for  example),  the  Pelect  number  for  a  small  Reynold  number  of  \/2  is  already  on  the  order 
of  a  thousand,  and  thus  cases  where  Pe  >  1  are  of  more  physical  relevance.  In  subsection 
3.3  we  present  numerical  solutions  to  the  amplitude  equations  for  the  Pe  ~  0(1)  case.  In  the 
following  section,  we  derive  the  amplitude  equation  for  the  nondiffusive  case  (Pe  1). 

3.1  Construction  of  the  amplitude  equations: 

Pe  -  0(  1) 

Adopting  the  scaling  discussed  in  section  2.2,  and  expanding  ip  and  6  as  follows 

ip  =  xpo  +  e,ipi+  e2ip2  +  H - ,  (14) 

9  =  9q  +  e8\  +  e202  +  e3#3  +  ■  •  •  ,  (15) 

we  substitute  the  above  expansions  into  equations  12  and  13.  Collecting  terms  order  by  order, 
(ipi,9i)  that  satisfy  the  periodic  boundary  conditions  (for  the  “fast  variable”  z)  are  obtained, 
and  the  solvability  condition  at  each  order  gives  rise  to  relationships  between  ipi  and  9t.  At 
the  zeroth  order  O(e0),  the  equations  are: 

IpOzZZZ  =  0,  p^5oz2  =  (1®) 

and  the  periodic  solutions  are 

ipo  =  A(Z,t),  bo  =  B(£,T).  (17) 

(18) 

(19) 

(20) 


(21) 

f^b2zz  =  -B^smz  -  (Re  +  Pe)  Af  cos  z  -  (■ y  +  j^)%  “  +  y%cos2z.  (22) 

At  this  order  C(e2),  the  solvability  condition  for  ^>2  gives  rise  to  the  critical  Reynolds  number 
Re  =  R0  =  y/2.  The  solvability  condition  for  62  gives  us  the  following  relationship  between 
B(=  0o)  and  A(=  ipo): 

( Y  +  ^)% + =  °.  (23) 


At  the  first  order  0(el),  we  obtain  the  following  equations 

ipizzzz  =  -A^sinz, 
tpOzB^  —  ipo^Bz  =  —bizz, 

and  the  periodic  solutions 

ipi  =  -ReA^sinz  +  Ai(£,t),  61  =  PeB^  sinz  +  I3i(£,t). 

At  the  second  order  0(e2),  the  equations  are  as  follows: 

TT^2zzzz  =  -Ait  sinz  -  ReA?  cos  z, 

Re  s 
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The  solutions  at  this  order  are: 


(24) 

(25) 


=  -Ro^sinz-Ro^cosz  +  ^^iT), 

Pe2 

62  =  PeRi^  sin  z  4-  Pe(Ro  +  Pe)A^B^  cos  z - —  cos  2  zB^. 

s  8 

Going  on  to  the  3rd  order  in  e,  we  have  the  following  equations  for  ^3: 

^Pizzzz  =  [Ro2-4|  -  3%c  -  -  A2]  sin  z  -  2R02Ai?A5  cos  z,  (26) 

and  the  solution  ^3  is  easily  obtained  as  follows: 

■^3  =  Ro[Ro2-4|  —  3-A^  —  A(_  —  A21H  sinz  —  2Ro3-4x^^cosz  +  .A3(£,t).  (27) 

The  solvability  condition  at  this  order  gives  us  the  amplitude  equation: 

(%)t  +  +  {[Ro  -  =  °-  (28) 

Following  Sivashinsky,  if  we  write  dz  =  dz  +  e3dr),  we  obtain  identical  solutions  till  the  second 
order  and  obtain  the  following  amplitude  equation  at  third  order: 

(%)r  =  -  {[Ro  - 

~AvAm  +  AZAtfri  +  +  p^/2  +  l/Pe'4' 

(29) 

We  note  that  the  buoyancy  amplitude  B  is  completely  slaved  to  the  stream  function  amplitude 
A  as  the  effect  of  the  stabilizing  temperature  gradient  is  put  to  higher  order  (e6).  Writing 
p  =  £  +  c?7,  we  can  turn  equation  29  into  a  uni-directional  amplitude  equation  [1]  in  terms  of 
p  and  r  as  follows: 

(app)t  =  — yapppppp  -  {[Ro - |-A2]Ap}ppp  4-  -~-{Ap)2pp  +  pe/2  pi/Pe"4,  (30) 

where  c  is  the  aspect  ratio  of  the  characteristics  of  the  uni-directional  flow. 

3.2  Lyapunov  functional 

In  this  subsection  we  derive  a  Lyapunov  functional  for  the  ID  amplitude  equation  (equation 
28).  Following  [7],  we  try  to  find  an  energetic  functional  of  amplitude  A  such  that  the 
evolution  of  the  amplitude  can  be  described  by  the  functional.  To  be  more  specific,  we  seek 
a  Lyapunov  functional  V[A\  such  that 

drA  =  drV  =  ~l  (A(^T)rM,  (31) 
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which  then  implies  that  the  system  cannot  sustain  oscillatory  motion  and  has  to  settle  down 
to  a  stationary  equilibrium.  For  the  ID  amplitude  equation  for  the  weakly  stratified  case, 
it  is  straightforward  to  find  a  functional  for  the  amplitude  equation  28  if  we  rewrite  it  as 
follows: 


(%)r  d;  SA  +  pe^2  + 

(32) 

where 

-  A\  -  24<ff. 

(33) 

Putting  Ctf  =  A ,  we  write  down  the  evolution  equation  for  F  as: 

dr  ~  J  Pe/2  +  l/P  eTJ  2  ^ 

(34) 

FePeC2 

This  indicates  that  the  new  functional  G  =  F  +  -pj+f  is  decaying  in  time  and  there  is  a 
stationary  solution  for  arbitrary  initial  conditions.  In  the  absence  of  the  stabilizing  stratifi¬ 
cation,  random  perturbation  of  small  scales  will  reach  a  stationary  solution  with  minimum 
number  of  nodes  within  the  domain,  i.e.,  the  scale  of  the  stationary  solution  is  the  size  of  the 
domain.  This  is  the  essence  of  inverse  cascade:  the  evolution  of  the  amplitude  is  such  that 
the  spatial  scale  increases  until  it  reaches  the  scale  comparable  to  the  size  of  the  computation 
domain.  In  the  nonstratified  case,  since  the  functional  is  expressed  in  the  gradient  of  A,  an 
the  fact  that  A  is  periodic  in  £,  we  conclude  that  the  stationary  solution  A(£)  should  have 
only  one  bump  inside  the  domain.  However,  this  inverse  cascade  is  arrested  by  the  presence 
of  stabilizing  stratification,  as  the  additional  term  C4  included  in  the  functional  prevents  the 
inverse  cascade  process.  This  will  be  demonstrated  in  the  following  subsection. 

3.3  Numerical  solution 

In  this  section  we  present  numerical  results  from  solving  the  amplitude  equations  using  a 
pseudo-spectral  code.  First  we  show  results  for  the  ID  version  of  the  amplitude  equation 
(equation  28).  Figure  4  demonstrates  the  stabilizing  effect  of  the  temperature:  the  ampli¬ 
tude  decreases  and  the  structure  tends  to  be  of  smaller  scale  as  we  increase  the  strength  of 
stratification.  As  shown  in  the  previous  subsection,  we  can  find  a  Lyapunov  functional  for 
this  equation  in  terms  of  the  gradient  of  the  amplitude,  therefore,  we  display  the  temporal 
evolution  of  the  amplitude  gradient  (figure  5).  Figures  5  show  the  time-space  plots  for  the 
gradients  of  the  amplitudes  without  any  stratification  in  (a)  and  with  an  1*6  =  0.1  in  (b).  We 
note  that  the  inverse  cascade  manifested  in  panel  (a)  is  arrested  by  the  presence  of  stabilizing 
stratification  in  panel  (b),  in  agreement  with  the  conclusion  we  draw  from  the  Lyapunov 
functional.  The  numerical  solutions  to  the  uni-directional  amplitude  equation  (equation  30) 
are  displayed  in  figures  6,  where  the  time  evolution  is  for  the  amplitudes,  not  the  amplitude 
gradients.  In  panel  (a)  of  figures  6,  where  there  is  no  stratification,  we  see  the  chaotic  behav¬ 
ior  of  the  flow  due  to  the  extra  nonlinear  term.  Yet  in  panel  (b)  where  F6  =  0.1,  we  see  the 
stratification  diminishes  the  chaotic  behavior  and  reduces  the  flow  to  spatially  periodic. 
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amplitude  gradient  with  F$  =  0  (a)  and  Fe  =  0.1  (b). 
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Figure  6:  Time-space  plots  of  the  amplitude  for  the  uni-directional  flow  with  ^6  =  0  (a)  and 
F6  =  0.1  (b). 


In  figures  7  we  show  the  solutions  (^o  +  A(£,  rj,  r))  to  the  2-D  amplitude  equation,  where  the 
computation  domain  has  been  scaled  by  an  aspect  ratio  c  =  20  as  suggested  in  [3].  The  only 
difference  between  these  two  snapshots  of  the  stream  functions  is  the  stratification  strength. 
We  note  that  the  effect  of  stratification  is  manifested  not  only  by  the  change  in  the  amplitude 
of  the  flow  but  also  the  flow  patterns:  the  stronger  the  stratification,  the  smaller  the  scales 
are  for  the  flow  patterns. 


Figure  7:  Stream  function  (zeroth  order)  from  the  2D  amplitude  equation  for  (a)  F$  =  0.01 
at  t  =  15  and  (b)  Fq  =  0.1. 

In  figures  8  and  9  we  show  the  horizontal  average  of  the  temperature  to  demonstrate  the 
potential  of  layer  formation  in  the  temperature.  Figure  8  is  a  snapshot  of  the  horizontal 
average  temperature:  Tq(z)  +  B(£,  rj,  t).  We  note  that  in  Panel  (a)  of  figure  9,  layers  disappear 


106 


Figure  8:  Snapshot  of  the  horizontal  average  of  the  total  temperature  profile  for  F§  —  0.01. 

and  re-appear  randomly,  while  in  Panel  (b)  layer  structures  eventually  disappear  due  to  the 
stabilizing  stratification  which  diminishes  the  flow. 


(a)  (b) 


Figure  9:  Horizontal  average  of  the  total  temperature  profile  at  various  times  for  (a)  Fq  =  0.01 
and  (b)  Fq  =  0.1. 


4  Internal  Boundary  Layer  for  large  Peclet  numbers:  Pe  — »  oo 

In  this  section  we  focus  on  the  instability  of  the  stratified  shear  flow  in  the  large  Peclet 
number  limit.  Figure  10  displays  characteristics  of  the  eigenfunctions  of  the  stratified  shear 
flow:  the  stream  function  disturbance  reaches  local  minima  while  the  temperature  peaks  at 
the  inflection  point  of  the  background  shear  flow. 

The  above  structure  reminds  one  of  the  no-slip,  no-flux  boundary  layer:  velocities  vanish 
at  the  walls  and  so  does  the  density  flux.  This  is  similar  to  what  we  observe  from  the  eigen 
function  (for  the  unbounded  case)  except  that  there  is  a  constant  background  vertical  velocity 
if  the  horizontal  wave  number  k  is  not  zero.  Figures  11  show  the  internal  boundary  layer 
structure  as  we  vary  the  Prandtl  number.  We  observe  the  decrease  of  the  internal  boundary 
layer  thickness  as  we  decrease  the  molecular  diffusivity.  By  balancing  the  advective  term 
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Figure  10:  Eigenfunctions  for  stratified  shear  flow.  The  Prandtl  number  Pr  =  104  and  sinz 
is  the  background  shear  flow.  The  solid  line  is  for  the  stream  function  disturbance  ip  and  the 
dashed  line  is  the  temperature  disturbance  6. 


(a)  (b) 

Figure  11:  (a)  Internal  boundary  layer  structure  as  the  Prandtl  number  Pr  increases.  The 
Prandtl  numbers  for  the  solid,  dashed,  and  dash-dotted  lines  are,  respectively,  10,  103,  and 
105.  (b)  The  boundary  layer  thickness  as  a  function  of  Pr  (note  that  since  the  Reynolds 
number  is  fixed,  the  Peclet  number  is  proportional  to  the  Prandtl  number).  The  solid  line  is 
the  best  fit  for  the  last  five  points,  which  indicate  that  the  thickness  scales  to  Pe-0'326.  The 
Richardson  number  Ri  is  fixed  at  0.01  and  the  Reynolds  number  is  Re  =  1.92. 
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(associated  with  the  background  shear  flow)  and  the  diffusive  term,  we  obtain  a  naive  scaling 
of  the  boundary  layer  thickness  ( l )  with  the  Peclet  number  Pe: 

l  ~  Pe"1/3,  (35) 

which  is  in  fair  agreement  with  the  empirical  fit  obtained  from  the  numerical  solutions  to  the 
linearized  equations.  Also  in  this  limit  of  large  (or  infinite)  Peclet  number,  the  scaling  and 
expansion  used  in  previous  section  to  derive  the  amplitude  equations  no  longer  work  inside 
the  "internal  boundary  layer”  as  terms  of  different  orders  are  mixed  up.  Thus  we  need  to  find 
a  new  scaling  inside  the  internal  boundary  layer  and  perform  asymptotic  matching  across  the 
internal  boundary  layer.  We  first  perform  asymptotic  matching  for  infinite  Peclet  number 
cases,  and  then  relax  the  infinite  Peclet  number  limit  to  large  Peclet  number  limit  (e10)  and 
derive  the  dispersion  relations  and  general  linear  solutions  in  subsection  4.2. 

4.1  Scaling  and  asymptotic  matching  for  the  internal  boundary  layer 

\\V  first  focus  on  the  linearized  version  of  equation  13  and  put  right  hand  side  to  zero: 

e2dT6  +  sinz0£  —  =  0.  (36) 

The  zeroth  order  solution  is  (A  is  the  amplitude  for  the  stream  function  disturbance  as  defined 
in  3.1) 


0o  =  jt!L  =  A-. 

sm  z  sin  z 

Tin  *  solutions  for  the  first  and  the  second  order  are: 


(37) 


sinz’ 


02 


smz 


The  t  hird  order  solution  takes  the  following  form: 


03£ 


H  _  V’Or 
sin  z  sin2  z  ’ 


(38) 


(39) 


As  the  background  shear  flow  goes  to  zero  at  z  =  0,  the  “outer”  solutions  shown  above  are 
no  longer  regular.  We  therefore  need  to  find  different  scaling  around  z  =  0  to  avoid  this 
embarrassment  by  matching  the  above  “outer”  solution  to  the  “inner”  solution,  to  be  derived 
m  the  following  with  the  new  scaling.  The  new  scaling  we  adopt  is  as  follows:  around  z  =  0 
we  scale  z  =  e3Z  and  6  =  e_30.  The  rescaled,  linearized  equation  takes  the  following  form: 


dT@  +  ZQ ^  =  0,  (40) 

where  we  have  replaced  sinz  with  e"Z.  To  perform  the  matching  between  inner  and  outer 
solutions,  we  first  write  the  inner  solution  0  as 


(41) 
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where  A  is  the  stream  function  amplitude  and  B  is  to  be  determined  by  matching  the  inner 
solution  to  the  outer  solution.  We  then  express  the  outer  solution  (full  solution  to  the  third 
order)  in  terms  of  the  rescaled  coordinate  Z  inside  the  internal  boundary  layer: 


9 


A  ,  RoA^sinz  ,  2Ro^i? sinz  +  Ro2A^ cos z  +  A2  ,  i/>3  C 

=  — - -  -f*  £  -  6  ”  i  6  [  ,  .  o 


smz 

A 


sin  z 


+  +  e2[RoAi^  + 


smz 

Ro2A|(1  -  e6)  +  ^2n  3/ 


sm  z  sir  z 

C 


e3Z  ■  '  e3Z  ^  ve3Z  e6Z2 

where  =  Ar.  The  leading  order  term  in  equation  43  (order  e~3) 


), 


2,).  («) 

(43) 


(44) 


gives  us  the  undetermined  B  as  follows: 


B ^  +  At  —  0. 


(45) 


Having  shown  how  the  asymptotic  matching  works  in  the  internal  boundary  layer,  we  press 
on  to  find  the  consistent  scaling  for  the  Peclet  number.  Adopting  the  same  scaling  for  the 
inner  solution  above,  we  have  to  put  Pe  to  order  e~10  to  have  the  diffusive  term  appeared  at 
the  first  order  in  the  equation  for  ©  inside  the  internal  boundary  layer: 


dTe  +  (ze 


^ = f^e“- 


(46) 


We  first  note  that  the  boundary  conditions  for  the  above  linear  equation  have  to  be  found 
by  matching  the  inner  solution  to  the  outer  solution.  Secondly,  we  note  that  the  zeroth 
order  term  ©o  has  non  trivial  Z  and  £  dependence,  in  contrary  to  what  we  have  found  for 
Pe  ~  0(1)  cases. 


4.2  Amplitude  equation  and  the  dispersion  relation 

The  previous  analysis  shows  that,  in  the  limit  of  large  Peclet  number,  ©  depends  on  Z  as 
well  as  £  and  r  .  With  this  in  mind,  we  proceed  from  equations  12  and  13  to  write  down 
the  amplitude  equation  for  the  internal  boundary  layer.  First  we  note  that  as  the  strength 
of  stratification  is  put  to  e6,  the  solutions  for  the  stream  function  obtained  in  section  3.1 
are  still  valid  in  the  internal  boundary  layer.  We  then  only  need  to  concentrate  on  the  heat 
equation  for  6: 

dt9  -  J(ip,0)  +  smzdx6  -  dxip  =  ^V26  (47) 

Adopting  the  scaling  dt  =  e4dr,  dx  =  edf,  dz  =  e~3dz  and  6  =  e-30  the  above  equation  takes 
the  following  form: 

edTe  -  e~5J(ip,  0)  +  e{ZQi  -  fy)  =  -p^®zz-  (48) 
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Rescaling  ©'  =  e6©,  ip'  =  e6ip  and  dropping  the  primes,  we  arrive  at  the  following  equation 
for  the  temperature  disturbance  inside  the  internal  boundary  layer: 

dr©  -  J(iP ,  0)  +  Z©?  -  ^  =  ~^f>zz.  (49) 

The  stream  function  amplitude  A  satisfies  the  same  equation  28  except  that  inside  the  internal 
boundary  layer,  the  average  of  0X  over  Z  is  not  simply  ©x  as  ©  depends  on  Z  as  well.  Also, 
we  have  to  rescale  A  accordingly  inside  the  internal  boundary  layer,  so  all  the  nonlinear  terms 
in  equation  28  drop  out  and  we  obtain  the  following  equation: 

3  f“°°” 

9tA^  =  -Ro(-A.££  +  -  Fq  j  *  @£<iZ,  (50) 

where  the  integral  range  (“  —  oo”,  “oo”)  is  referred  to  the  scaled  internal  boundary  layer.  To 
zeroth  order  in  e,  we  obtain  the  following  equation: 

dr©  -  Az&z  +  Z©s  -At  =  ^©zz,  (51) 

•oo 

where  A  =  A(£,  r)  is  the  amplitude  for  the  stream  function.  Equations  50  and  51  are  the 
amplitude  equations  for  the  internal  boundary  layer.  The  linear  equations  for  the  internal 
boundary  layer  are 

drA^  =  — Ro(-A££  +  A) — Fq  j  ©^ dZ ,  (52) 

dT©  =  -Z0£  +  Ac  +  -J-0zz.  (53) 

-*10 

We  first  derive  the  dispersion  relation  for  the  infinite  Peclet  number  case.  Replacing  d £  with 
ik  and  dr  with  s,  we  obtain  the  following  equations: 

-k2[s  +  ^2. k 4  -  R 0k2]A  =  -F6  r  ik©dZ ,  (54) 

2  J -oo 

(. s  +  iZk )0  =  ikA.  (55) 

Substituting  equation  55  into  54,  we  obtain  the  dispersion  relation  for  the  infinite  Peclet 
number  case: 

s  =  —j^sgn{s)n  -  ^-pA;4  +  Rofc2.  (56) 

1*1  2 

In  the  case  of  finite,  large  Peclet  numbers,  we  expand  ©  in  both  elkx  and  eiqz  and  obtain  the 
following  equations 

—  (s  +  kq2)©  +  09  =  -2niA6(q),  (57) 
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where  ©  and  A  are  Fourier  components  in  the  k  —  q  spectral  space.  The  solution  to  equation 
57  is 


_  r  oo 

©  =  e(s9+fc9  /3)/fc  /  2niAS{q')e-(~sq'+kq  /3)/kdq' 

Jq 

=  e{sq+kq3/3)/k2niAH{-q), 


(58) 

(59) 


where  H  is  the  Heaviside  function.  Substituting  the  above  normal  mode  solution  for  0  into 
52,  we  get  the  dispersion  relation  for  the  large,  finite  Peclet  number  case: 

5  =  -^7r-^£A:4  +  Rofc2.  (60) 

1*1  2 

We  note  the  difference  between  equation  56  and  60  is  the  existence  of  sgn(s),  and  we  display 
the  two  dispersion  relations  in  figure  12. 
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Figure  12:  Dispersion  relation  for  the  internal  boundary  layer  (Ro  =  \/2).  Curves  are  labeled 
by  the  scaled  Richardson  number  F6:  From  curve  1  to  curve  4,  F$  are,  respectively,  0,  0.005, 
0.02,  0.045.  The  solid  lines  are  for  the  infinite  Peclet  number  cases,  and  the  dotted  lines  are 
for  large  Peclet  number  cases. 

We  also  note  that  for  any  given  Reynolds  number,  the  value  of  F$  such  that  the  maximum 
growth  rate  s  is  zero  is  proportional  to  the  Reynolds  number  and  the  ratio  is  0.0322.  We 
are  now  ready  to  find  the  general  solution  to  0  for  large  Peclet  number  cases.  We  rewrite 
equation  53  as  follows: 

k&zz  —  (s  +  ikZ)Q  =  ikA,  (61) 

where  k  =  1/Pio-  Dividing  the  above  equation  by  ikA  and  denote  /  =  Q/ikA,  we  obtain  the 
following  equation  which  allows  us  to  find  a  closed-form  solution: 

KfZz-ik(Z-is/k)f  =  1.  (62) 

The  solution  is  the  Yi  function: 


0  =  — Yi[k{Z -is/k)]. 


(63) 
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Figure  13:  Dispersion  relation  for  the  internal  boundary  layer  (Ro  =  \/2).  Dashed  lines  axe 
for  order  0(1)  Peclet  numbers  and  solid  lines  are  for  large  Peclet  numers.  Curve  ‘1’  is  for 
Fq  =  0,  curves  ‘2’  and  ‘a’  are  for  Fq  =  0.001,  curves  ‘3’  and  ‘b’  are  for  Fq  =  0.01,  curves  ‘4’ 
and  ‘c’  are  for  Fq  =  0.03,  and  curves  ‘5’  and  ‘d’  are  for  Fq  =  0.045. 

5  Inviscid  limit  of  the  stratified,  Kolmogorov  shear  flow 

In  this  section  we  conclude  the  report  by  presenting  a  brief  study  on  the  inviscid  limit  of  the 
stratified  Kolmogorov  shear  flow.  A  general  review  on  linear  analysis  of  the  inviscid  shear  flow 
can  be  found  in  [8].  Here  we  provide  a  way  to  find  neutral  states  for  the  unbounded,  stratified 
Kolmogorov  shear  flow.  We  have  numerically  verified  the  marginal  boundary  presented  in 
the  following  analyses,  and  it  would  be  an  interesting  direction  to  provide  analytic  proofs 
that  this  is  indeed  the  case.  We  should  also  point  out  that  there  may  be  hope  to  couple  the 
critical  layers  (CL)  associated  with  each  inflection  point  in  the  background  shear  flow,  and 
hen.ce  the  interaction  between  CLs  can  be  investigated. 

5.1  Linear  analysis:  analytical  and  numerical 

The  inviscid,  nondiffusive  system  (1/Pe  =  0)  is  described  as  follows  (where  the  shear  flow  is 
a  sinusoidal  sin  2  in  a  periodic  domain) : 

dtV2V>-  J(V’,VV)+sinzai(VV  +  V’)  =  -Fdx0,  (64) 

dt&  —  J(ifi,  6)  4-  sin  zdx0  —  dxip  =  0.  (65) 

The  linearized  equations  can  be  put  into  the  following  equation  with  the  diffusivity  being 
zero: 

(sinz  —  c)(D2  —  k2)ip  +  sin  zip  =  — .  — — ,  (66) 

sin  z  —  c 

where  D  =  dz  and  c  is  growth  rate  divided  by  the  wave  number.  In  this  notation,  the 
imaginary  part  of  c  indicates  instability:  positive  imaginary  part  means  growing  mode  and 
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negative  imaginary  part  means  decaying  mode.  We  reorganize  the  above  equation  into  a 
more  familiar  form  usually  found  in  the  past  literature: 


V>"  -  k2ip  + 


sin  zip 
sin  z  —  c 


Ftp 

(sinz  —  c)2 


(67) 


First  we  put  c  =  0.  though  in  general  only  the  imaginary  part  is  required  to  be  zero  on  the 
neutral  curve.  Equation  67  now  takes  the  following  form: 

D2ip  +  (1  -k2  +  -£-)ip  =  0.  (68) 

sir  z 


The  above  equation  can  be  solved  as  follows:  first  we  put  the  left  hand  side  of  equation  68 
as  the  product  of  two  differential  operators  as  defined  as  follows: 

(D2  +  1  -  k2  +  =  CCH  =  0,  (69) 

sin  z 


where  £  and  C)  are  defined  as 

C  =  D  + 


cot  z’ 


£f  =  D 


cot  z’ 


(70) 


and  a  is  to  be  determined  (in  terms  of  k  and  F).  We  note  that  ££^  =  D2  —  (f  +  f)  where 
/  =  cot  z,  and  relationships  between  a,  k,  and  F  are  obtained  as  follows: 

a  =  l-k2,  F  =  a  —  a2  =  y/l  -  fc2  -  (1  -  k2).  (71) 


The  first  solution  ipi  is  obtained  by  demanding  Cip i  =  0  and  takes  the  following  form: 


ipi  =  (sinz)^1  fc2,  z  >  0.  (72) 

The  second  solution  ip2  satisfies  the  following  equation 

tfip 2  =  (sinz)-v/1_A:2,  (73) 


and  is  obtained  as  follows: 

V-2  =  (sin  z)'/TZF  JZ  (sin  z,)-2VTZ^dz'.  (74) 

For  some  value  of  k ,  the  second  solution  is  not  periodic  in  z  and  therefore  is  not  of  particular 
interest.  F,  as  a  function  of  k,  is  shown  in  figure  14.  First  we  note  that  the  maximum  value 
of  F  is  1  /4  when  k  =  -y/3/4.  We  also  note  that  k  goes  from  0  to  1  as  we  are  only  interested 
in  positive  F. 
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Figure  14:  Stability  boundary  for  the  inviscid,  non-diffusive  limit. 

6  Conclusion 

We  have  investigated  the  effect  of  stabilizing  stratification  on  the  Kolmogorov  shear  flow  in  the 
weak  limit  where  the  long  wavelength  instability  inherits  from  the  non-stratified  shear  flow. 
Concentrating  on  cases  where  Pe  ~  0(1),  we  first  derived  amplitude  equations  for  the  weakly 
stratified  Kolmogorov  shear  flow  and  demonstrated  the  stabilizing  effects  by  numerically 
solving  the  amplitude  equations.  For  the  1-D  amplitude  equation,  the  stabilizing  gradient 
arrests  the  inverse  cascade  and  weaken  the  flow.  For  the  uni-directional  amplitude  equation, 
the  gradient  not  only  lessens  the  flow,  but  also  diminishes  the  chaotic  behavior  of  the  uni¬ 
directional  solution.  The  same  phenomena  have  been  observed  for  the  large  aspect  ratio  2-d 
solutions  to  the  full  amplitude  equation.  For  the  nondiffusive  limit  (Pe  ~  e-10),  the  dynamics 
are  dominated  by  the  internal  boundary  layer.  From  the  linear  eigenfunctions,  we  are  able 
to  estimate  an  empirical  scaling  of  boundary  layer  thickness  with  the  Peclet  number.  We 
choose  boundary  layer  scaling  accordingly  and  derive  amplitude  equations  for  the  internal 
boundary  layer.  Dispersion  relations  are  derived  and  utilized  for  some  preliminary  analysis. 
The  linear  stability  of  the  stratified,  inviscid  Kolmogorov  shear  flow  has  been  investigated  as 
a  preliminary  step  to  the  weakly  nonlinear  analysis  which  is  now  under  investigation. 
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What  Comes  Around  Goes  Around: 
A  Bug’s  Life 

Jennifer  Curtis 


1  Introduction 

Many  biological  phenomena  such  as  the  spread  of  a  favored  gene,  the  population  growth 
of  species,  ecological  competition,  and  others  are  described  by  equations  that  contain  the 
dominant  physical  processes  of  diffusion,  convection,  and  a  background  reaction  [6].  The 
model  studied  in  this  paper  describes  the  limited  universe  of  these  forces  and  its  influence 
on  the  lives  and  deaths  one  colony  of  photosynthetic  bacteria  living  on  an  inhomogenous 
substrate. 

Interestingly,  the  model  reaches  beyond  the  scope  of  biology  to  touch  upon  unexpected 
research  areas  in  condensed  matter  physics,  including  vortices  in  superconductors  [7]  and 
semiconductor  physics  [5].  The  bacteria  and  vortex  systems  are  mathematically  analogous, 
non-Hermitian  models  which  have  drawn  a  great  deal  of  interest  [1],[3]  due  to  their  ability  to 
undergo  a  delocalization  transition  in  their  eigenfunctions.  Previously  such  transitions  were 
believed  to  be  impossible  in  one  or  two  dimensional  systems  [5]. 

The  goal  of  this  paper  is  to  continue  the  analysis  of  Dahmen,  Nelson,  and  Shnerb  (DNS) 
[4], [2].  by  studying  the  delocalization  transition  in  the  presence  of  a  weakly  non-linear  satu¬ 
ration  term.  This  term  represents  the  crowding  of  the  bacteria  due  to  competition  or  deadly 
concentrations  of  toxins  from  their  waste. 

The  model  is  new  territory  for  the  mathematical  analysis  of  pattern  formation  and  pop¬ 
ulation  dynamics  in  biology.  With  the  exception  of  DNS,  very  little  work  has  been  done  on 
this  type  of  inhomogenous  system.  Our  analysis  leads  to  the  very  interesting  result  that  the 
dynamics  of  the  model  is  governed  by  a  differential-delay  equation.  This  delay  equation  is 
explored  with  the  hopes  that  oscillations,  quasi-periodicity  or  chaos  might  arise  within  the 
physical  regime  of  the  model.  A  little  familiarity  with  delay-equations  and  a  comparison  with 
another  familiar  delay-equation  of  mathematical  biology,  the  Glass-Mackey  delay-equation  [6] 
,  suggests  a  structural  method  to  quickly  predict  the  dynamic  behavior  of  a  subset  of  such 
equations. 

Finally,  several  efforts  are  made  to  reasonably  modify  the  physical  system  to  achieve 
interesting  dynamics.  These  attempts  include  changing  the  form  of  the  non-linear  saturation 
and  the  increasing  the  spatial  complexity  of  the  system.  The  former  is  proved  to  be  stable, 
while  the  latter  remains  an  open-ended  question. 
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2  The  Modified  Fisher  Equation 


Imagine  one  species  of  bacteria  living  in  a  periodic,  one-dimensional  ring  with  coordinates  in 
x,  0  <  x  <  L.  Located  at  at  the  origin  is  an  “oasis”  where  plentiful  supplies  of  food  and  light 
support  life.  The  rest  of  the  ring  is  deadly  to  the  bacteria.  DNS  call  this  region  a  “desert”. 
The  bacteria  experience  diffusion,  as  well  as  a  convective  drift  due  to  a  background  current. 
They  also  compete  with  their  fellow  neighbors  as  mentioned  above.  This  is  the  source  of  the 
non-linear  term  neglected  by  DNS.  The  equation  governing  this  model  is  the  Fisher  equation 
with  an  extra  term  included  to  account  for  convection.  Originally,  the  Fisher  equation  was 
proposed  as  a  model  for  the  spread  of  a  favored  gene.  The  modified  Fisher  equation  is  then 

ct(x,t)  +  ucx(x,t)  =  Dcxx{x,t)  +  [f35{x)  -  a]c(x,t)  -  bc2(x,t )  ,  (1) 

where  c(x,t)  is  the  concentration  of  the  bacteria.  A  delta  function  of  strength  /?  represents  the 
oasis  and  the  —a  term  represents  the  death  rate  of  the  desert.  The  combination  of  /36(x)  —  a 
is  the  spatial  inhomogeniety  of  this  particular  system. 

DS’s  biological  motivation  for  suggesting  this  model  was  to  study  the  effect  of  spatial 
inhomogenieties  in  the  underlying  medium.  Disorder  in  the  medium  may  be  due  to  many 
things,  including  random  diffusion  constants,  stochastic  growth  and  death  rates,  or  a  random 
concentration  of  environimental  factors  such  as  food,  toxins  or  illumination.  Here  we  use  the 
simplest  choice,  a  random  concentration  of  food  and/or  illumination. 

A  possible  experiment  suggested  by  DNS  is  to  place  the  bacteria  in  a  thin  annular  ring 
covered  by  a  dark  mask  with  a  small  slot  cut  to  let  light  pass  through.  Turning  the  mask  at 
a  slow  speed  while  the  ring  remains  fixed  would  simulate  the  convection  current.  Currently 
DNS  are  talking  with  experimental  biologists  to  do  this  experiment.  Another  practical  sys¬ 
tem  where  this  model  may  be  applied  is  the  circumpolar  current  around  Anartica.  It  has 
been  shown  to  carry  photosynthetic  plankton  completely  around  the  continent,  with  various 
patches  of  nutrient-rich  upswellings  supporting  the  plankton. 


3  Linear  Stability  Analysis 

The  following  section  reviews  the  analysis  of  the  linearized  modified-Fisher  equation.  Here 
we  become  familiar  with  the  delocalization  transition  that  occurs  in  this  biological  model  and 
with  the  associated  behavior  of  the  eigenspectrum.  Linearizing  about  the  fixed  point  c  =  0 
leads  to 


ct(x,  t)  +  ucx(x,  t )  =  Dcxx(x,  t )  +  [f38(x)  -  a]c(x,  t )  .  (2) 

3.1  Without  the  oasis: 

An  example  of  delocalized  modes 

Delocalized  eigenfunctions  are  those  solutions  which  have  a  form  like  elkx,  where  k  is  complex 
and  the  real  part  is  non-zero.  The  simplest  example  of  a  delocalization  occurs  if  /3  =  0. 
Solutions  are  of  the  form  c(x,i)  =  estelkx.  Periodicity  requires  that  k  be  quantized  as  k  = 
27 rm/L,  where  m  is  an  integer. 
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Figure  1:  Regime  diagram  for  an  infinite  ring 


The  associated  dispersion  relation  is 

s  =  — Dk 2  +  a  —  iuk  .  (3) 

The  growth  rate,  s,  is  a  discrete  set  of  complex  numbers.  The  plot  of  Re(s)  versus  Im(s)  is  a 
parabola  symmetric  to  the  real  axis.  Increasing  the  velocity  u  broadens  the  parabola.  When 
u  =  0,  the  growth  rate  is  real.  The  value  of  a  determines  the  stability  of  the  eigenfunctions. 
The  growth  rate  of  the  mth  eigenfunction  will  be  positive  if 

a>D{—)  ■  <4) 

The  eigenfunction  with  the  largest  positive  growth  rate  ( k  =  0)  will  dominate  the  system 
at  large  times. 

3.2  An  infinite  ring 

We  begin  the  linear  analysis  of  our  system  for  the  case  of  an  infinite  ring  because  it  has 
simple  analytical  results  which  clearly  demonstrate  the  signature  of  delocalization.  Assume 
c(x,t)  =  estc(x)  to  eliminate  the  time  dependence  in  (1), 

sc(x)  +  ucx(x)  =  Dcxx(x )  +  [fi6(x)  -  a]c(x)  .  (5) 

It  may  also  be  written  as 

ct  =  Cc ,  (6) 

where  the  linear  operator, 

Cc  =  Dcxx  -  ucx  +  \fiS{x)  -  a]c  ,  (7) 
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generates  the  time-evolution  of  the  system.  When  u  =  0,  the  operator  is  Hermitian  with  real 
eigenvalues,  and  for  strong  enough  disorder,  all  of  its  eigenfunctions  are  real  and  localized 

Pi- 

Using  c(x)  =  e  kx,  a  dispersion  relation  is  found  to  have  two  roots  for  the  wavenumber 

k, 


k± 


Dk 2  +  uk  —  s  —  a  =  0  , 

—u  ±  y/v?  +  4 D(s  +  a) 

—  . 


(8) 


Periodicity  is  satisfied  only  if  the  eigenfunction  c(x)  =  e~kx  decays  as  x  — >  ±oo.  Thus  we 
use  k+  when  x  >  0  and  k-  when  x  <  0.  This  restriction  is  the  equivalent  of  solving  for  only 
the  localized  eigenfunctions  of  this  physical  system,  i.e.  we  are  working  in  the  regime  where 
k±  is  real. 

Using  the  appropriate  eigenfunction  to  the  left  and  right  of  the  origin,  we  integrate  (patch) 
across  the  delta  function  to  acquire  a  value  for  the  growth  rate  s: 


-Dlcx!  +  P  —  o , 

D[-k+  -  (-fc-)]  =  ~P  , 

P2  —  u2 

=*•  s  =  — — - a  . 

4  D 

The  expression  for  k±  (8)  can  now  be  simplified  using  the  growth  rate  (9), 

,  _  -u±P 
k±~  2D  ’ 

=>  P  >  u  when  u  >  0  , 

[3  >  —u  when  u  <  0  . 


(9) 


(10) 


The  requirements  on  k±  restrict  the  range  of  u,  according  to  the  given  strength  of  the  oasis 
fj.  We  liave  one  localized  solution,  although  if  the  oasis  were  wider  than  a  delta  function,  say 
a  box.  there  would  be  many  localized  solutions. 

A  regime  diagram  (figure  1)  maps  the  properties  of  this  system  as  ft  and  u  vary.  The 
two  straight  lines,  ft  =  u  and  ft  —  —  tt,  are  the  boundaries  which  restrict  k±.  If  crossed,  the 
eigenfunctions  will  be  in  the  delocalized  regime. 

The  marginal  stability  curve  is  the  hyperbola  labeled  s  =  0.  As  the  parameters  u  and 
P  tend  to  infinity,  the  marginal  stability  curve  coincides  with  the  delocalization  transition. 
Inside  of  the  hyperbola,  s  is  positive,  and  thus  the  eigenfunctions  are  unstable  and  grow  in 
time.  Outside  of  the  hyperbola,  s  is  negative,  and  the  eigenfunctions  are  stable  and  decay 
with  time. 

Larger  values  of  a  and  |u|  shift  the  marginal  stability  curve  upwards,  increasing  the  regime 
of  stability.  This  is  because  a  is  the  size  of  the  death  rate;  while  in  an  infinite  ring,  larger 
velocities  carry  more  bacteria  and  being  carried  away  from  the  oasis  is  a  sure  death  sentence 
in  an  infinite  ring. 
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Figure  2:  c(x)  for  positive  velocities 

The  behavior  of  the  concentation  c(x)  when  there  is  positive  background  velocity  is  ex¬ 
amined  in  figure  (2).  The  plots  are  asymmetric  due  to  the  eigenfunction  dependence  on  the 

MLII1  of  u: 

c(x±)  =  .  (H) 

The  velocity  blows  bacteria  away  from  the  orgin  to  the  right,  increasing  their  concentration 
m  t  his  region.  To  the  left  of  the  origin,  the  competing  mechanisms  of  diffusion  away  from 
tie  oasis,  and  advection  back  into  the  oasis  result  in  a  thin  boundary  layer.  If  no  wind  or 

current  were  present,  the  distribution  would  be  symmetric,  decaying  exponentially  away  from 
t  h«  orgin. 

As  the  value  of  the  velocity  increases,  the  concentration  becomes  nearly  constant  across  x 
1 -cause  more  bacteria  is  being  blown  out  of  the  origin.  For  this  particular  example,  choices 
*' 1  *1(>  velocity  u  are  limited  by  the  choice  of  /3  =  1.  At  u  =  .99  for  instance,  the  eigenfunction 
-  nearly  delocalized.  If  we  surpassed  u  =  lwe  would  be  examing  the  delocalized  spectrum. 
The  one  localized  mode,  is  unstable  if 

/32  >  4 Da  +  it2  .  (22) 

This  section  has  been  included  to  introduce  the  problem  and  gain  some  intuition  for  the 
df 'localization  transition,  and  its  dependence  on  the  physics  (D,  /?,  u)  of  the  system.  Note 
that  for  this  infinite  ring  case,  the  velocity  carries  the  bacteria  away  from  their  haven,  never 
returning  them  in  time  before  they  die.  The  velocity  has  a  purely  deadly  effect.  Thus,  the 
inequality  above,  a  requirement  for  instability,  makes  sense.  Only  if  the  life  production  in 
the  oasis,  /9,  is  large  enough  to  overcome  the  deadly  effects  of  diffusion  and  convection,  will 
t  he  system  grow  in  time. 

3.3  A  finite  ring  with  no  convective  drift 

We  now  study  the  linearized  problem  in  a  finite  ring  with  no  wind.  When  u  =  0,  (5)  becomes 

sc  =  Dcxx  +  [/36(x)  -  a]c  .  (13) 
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Figure  3:  (a)  a  =  §§  vs  kL  when  u  =  0  in  finite  ring  (b)  roots  of  z(i  cos(x))  + 

a  sin(x)  =  0.  *  =  a  =  0;  +  =  a  =  5;  o  =  a  =  15;  x  =  a  =  25;  <=  a  -  35 

We  arrange  that  the  unknown  concentration  c(x)  will  be  equal  to  1  at  x  =  0.  The  eigenfunc- 
tions  will  then  be  of  the  form, 

c(x)  =  (1  -  A)e~kx  +  Aekx  .  (14) 

Using  periodicity,  c(0)  =  c(L),  we  find  an  expression  for  A 


2  sinh  kL 


Combining  this  result  with  (14),  gives  the  expression  for  c(x): 


(1  _  e~kL) 

r(r')  =  e~kx  4-  — - -  sinhfcx  . 

c{x)-e  +  s[nhkL 


Patching  across  the  delta  function  gives  a  transcendental  relation  for  the  wavenumber  k: 

2k(l  -  cosh  kL)  =  sinhfcL  .  (17) 

This  is  the  same  expression  found  for  k  by  DNS  [4],  if  one  sets  their  velodty,  v,  equal  to 
zero.  Their  result  was  found  by  solving  the  problem  for  a  periodic  domain  with  an  oasis  that 

is  a  finite  square  well,  and  then  taking  the  area  of  the  well  to  zero. 

To  study  the  delocalized  spectrum,  this  expression  can  be  neatly  rewritten  by  lett  g 
k  —  in  and  a  =  j9L/(2I>): 

kL{  1  -  cos(kL))  +  o  sin  (kL)  =  0  .  (18) 

In  this  form  we  assume  that  the  wavenumber  k  is  purely  imaginary.  The  spectrum  may 
be  sTudS  graphically  by  plotting  <r  as  a  function  of  kL  (see  figure  3(a)).  The  wavenumbers 
k  for  fixed  a  are  found  by  drawing  a  line  at  one  value  of  a  and  intersecting  the  curves.  The 
numerical  results  are  plotted  in  3(b)  for  various  values  of  <r. 
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A  double  degeneracy  exists  in  the  solutions  of  the  finite  ring  without  a  delta  function 
(<r=0).  This  degeneracy  is  not  reflected  in  3(a)  because  the  dispersion  relation  was  obtained 
after  dividing  out  one  extra  factor  of  cosh(A:L).  The  degeneracy  of  the  eigenvalues  is  broken 
by  turning  on  the  delta  function  strength  so  that  a  ^  0.  As  a  is  increased,  one  of  the  two  sets 
of  eigenvalues  moves  away  from  the  general  oscillatory  solutions  k  =  2itm  of  section  (3.1).  As 
the  strength  of  the  delta  function  increases,  the  moving  eigenvalues  asympotote  to  (2ra  +  l)7r. 
The  other  eigenvalue  remains  fixed  at  these  values. 

None  of  these  delocalized  modes  are  unstable  in  time,  although  we  will  see  in  the  next 
section,  that  they  can  be  unstable  when  there  is  a  strong  enough  wind  to  help  blow  the 
bacteria  around  the  ring  before  they  die. 

One  localized  mode  exists.  The  solution  can  be  found  analytically  if  one  assumes  that  a 
is  very  large  so  that  cosh  a  ~  sinh  a.  In  this  limit,  we  find  that  kL  — *•  a.  The  localized  mode 
can  be  unstable  depending  on  the  values  of  the  parameters  k  and  a.  The  dispersion  relation 
for  this  system  is 

s  =  Dk2  -a.  (19) 


3.4  Finite  length  and  a  constant  wind: 

Traveling  around  the  ring 

We  now  explore  the  full  linearized  problem  in  a  finite  ring.  The  relevant  partial  differential 
equation  is 


sc(x)  +  ucx(x)  =  Dcxx{x)  +  [/35(x)  —  a]c(a:)  .  (20) 

We  begin  with  the  assumption  of  the  form  of  the  concentration  c(x),  so  that  c  =  1  at  the 
origin, 

c(x)  =  ((1  -  A)e~kx  +  Aekx)e~%  .  (21) 

Applying  periodicity  gives  an  expression  for  the  constant  A 

uL  V.  T 

62d  —  e 


—  .  (22) 

2  sinh  kL  v  ’ 

Now,  to  obtain  an  expression  for  k  similar  to  (17),  we  repeat  patching.  The  ucx(x)  term 
adds  nothing  new  since  it  is  zero  when  integrating  over  x;  however  the  derivative  of  c(x)  is 
now  more  complicated  due  to  the  addition  of  u/2D  in  the  exponential.  Using  the  definition, 
rj  =  uL  1 2D  where  77  is  the  Peclet  number  of  fluid  mechanics,  we  find 


2k  (cosh,  rj  -  cosh  kL)  =  — sinh  kL  . 


(23) 


Note  that  if  u  =  0,  it  is  identical  to  (17)  as  it  should  be.  Also,  if  R e(k)  >  u  and  L  -»  oo  the 
dispersion  relation  reduces  to  (9). 

Given  the  concentration  in  (21),  the  dispersion  relation  is 


u 


s  =  Dk2~w~a- 


(24) 
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Re(s)  Re(s) 

Figure  4:  The  movement  of  the  localized  mode  into  the  delocalized  mode  with  increasing 
velocity.  Complex  s  is  a  signature  of  delocalization 


To  learn  more  about  this  new  dispersion  relation  for  k:  let  k  =  in  and  use  the  definition  for 
n: 


kL  (cosh 77  —  cos(/sL))  +  a  sin kL  =  0  .  (25) 

When  k  is  complex,  the  system  is  best  studied  using  a  numerical  algorithim  to  solve  for 
the  growth  rate  s.  The  results  are  very  interesting.  We  find  that  the  eigenspectrum  is  a 
parabola  on  the  complex  plane  as  it  was  in  section  (3.1),  but  now  we  also  have  one  real 
eigenvalue  which  varies  in  its  distance  to  the  parabola  depending  on  the  velocity  u .  This  is 
the  one  localized  mode  that  accompanies  the  delta  function.  As  the  velocity  is  increased, 
the  eigenvalue  moves  towards  and  finally  onto  the  apex  of  the  parabola  which  remains  fixed. 
This  represents  the  same  delocalization  transition  we  experienced  in  section  (3.2)  when  we 
crossed  the  boundaries  /3  =  Figure  4(b)  shows  a  series  of  spectra  with  increasing  velocity. 
Here  the  critical  delocalization  velocity,  the  velocity  at  which  the  real  eigenvalue  moves  onto 
the  parabola,  is  determined  by  a  more  complicated  relationship  between  /3  and  u  which  we 
examine  in  the  next  section. 

3.5  An  important  limit  for  the  finite  ring  with  wind 

Here  we  present  a  nice  way  to  represent  the  delocalization  transiton  for  a  large  but  finite 
ring.  This  is  a  new  addition^to  the  analysis  done  by  DNS. 

In  the  limit  that  L  is  large,  (23)  reduces  to  the  dispersion  relation 

m-  (26) 

We  define  a  parameter  P,  which  in  the  limit  that  L  — ►  00,  is  a  measure  of  our  closeness  to 
the  delocalization  threshold 

P  =  -  -  1  .  (27) 

u 
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In  order  to  examine  the  complex  eigenvalue  spectrum,  we  rewrite  k  in  terms  of  a  complex 
parameter 


k  =  r)[l  +  PC]  , 


where  for  eventual  simplicity,  (  is 


C  =  1  + 


x  +  iy 


Then  k  becomes 


k  =  77(1  +  P)  +  x  +  iy  . 


(28) 


(29) 


(30) 


Putting  k  (28)  into  the  dispersion  relation  (26)  and  simplifying  gives  the  expression, 


e~p *  =  1  - 


P  +  1 

1  +  PC  ’ 


Making  the  assumption  that  PC  <<  1,  we  have  a  final  expression  for  C 


C  = 


(31) 


(32) 


Using  the  definitions  for  C  (29,  32)  and  plugging  it  into  k  (28),  gives  a  nice  expression  for  the 
wavenumber 


k  =  y[l  +  P  +  e~vPe~x~iy]  . 

Another  useful  relation  is  obtained  by  combining  (30,  33) , 

x  +  iy  —  pe~x~iy  , 


where  we  have  used  the  definition  of  p 

p  =  r)e~Pv  . 


(33) 

(34) 


(35) 


Setting  real  and  imaginary  parts  equal,  we  have  two  expressions  for  x  and  y, 

x  =  pe~xcosy,  (36) 

y  =  —pe~xsiny .  (37) 

A  little  maninpulation  of  these  expressions  gives  the  final  form  that  we  use  to  explore  the 
delocalization  transition, 

x2+y2  =  p2e~2x ,  (38) 

—  =  —  tan  y  .  (39) 

x 


125 


11 


Figure  5:  (a)  A  graphical  solution  of  (37).  The  solid  curve  is  T]ev  and  the  dashed  curves  show 
pcos  y/p2e_2?'  —  r?2j  for  two  values  of  p  =  [2, 6]  * exp(-l).  (b)  Eigenvalue  loci  obtained  from 

(39)  for  various  values  of  p.  The  dotted  curve  is  p  =  1/e  and  the  multi-branched  dashed 
curves  are  loci  with  p  <  1/e.  The  solutions  obtained  from  the  intersections  of  figure  5(a)  can 
be  placed  on  the  appropriate  curves  above. 


Figure  (5)(b)  plots  contours  of  p  in  the  x-y  plane.  The  pair  of  values  (x,  y)  for  each 
wavenumber  k  is  found  by  solving 


x  =  pe  x  cos 


(40) 


for  a  given  p.  The  most  unstable  mode  (the  largest  solution)  is  xq ,  which  is  determined  by 
£0ex°  =  p.  The  cooresponding  yo  is  always  zero,  so  that  the  most  unstable  model  has  a  real 
eignevalue.  The  higher  modes  occur  in  complex  conjugate  pairs. 

For  a  given  value  of  p,  one  can  draw  the  eigenvalue  locii  in  the  x+iy  plane.  It  is  found 
that  when  p  <  exp(-l),  two  contour  curves  exist  in  the  x-y  plane,  while  if  p  >  exp(-l), 
only  one  curve  exists.  The  associated  wavenumbers  k  are  represented  as  points  (x,  y)  on 
the  curves,  corresponding  to  the  intersections  in  figure  5(a).  If  we  are  in  the  region  where 
p  >  exp(-l),  the  contour  of  p  will  be  to  the  right  of  the  set  of  half  circles  near  the  origin.  An 
infinite  set  of  discrete  pairs  of  (x,y)  (and  thus  wavenumbers  k)  are  found  along  each  contour 
in  that  set.  If  p  <exp(-l),  two  curves  exist,  a  half  circle  and  a  line  somewhere  to  the  left 
of  the  half  circle.  Only  one  pair  (x0,y0)  exists  on  the  associated  half  circle,  representing  the 
one  localized  mode.  The  second  curve  will  have  an  infinite  but  discrete  set  of  (x,y)  pairs 
along  it,  representing  the  the  delocalized  spectrum.  When  p  =  exp(— 1),  we  are  right  at  the 
delocalization  transition.  This  corresponds  to  the  eigenvalue  s  moving  onto  the  apex  of  the 
parabola  in  figure  4(b).  Thus,  the  value  of  the  non-dimensional  parameter  p  determines  at 
what  velocity  the  delocalization  transiton  occurs,  for  a  given  /3,D,L. 

A  similar  condition  restraint  of  the  velocity  like  |uj  <  /3  of  the  infinite  ring,  comes  in  the 
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(41) 


form  of  the  transcendental  equation 

p  =  <  exp(-l)  . 

4  Weakly  non-linear  anaylsis  near 
the  delocalization  transition 

The  goal  of  this  work  is  to  study  the  effect  of  the  non-linear  saturation  term  on  the  delocal¬ 
ization  transition  that  we  observed  in  the  linear  analysis.  In  the  following  sections  we  explore 
this  effect  in  infinite  and  finite  diameter  rings. 

The  previous  section  reviewed  the  results  of  DNS  and  carefully  studied  the  existance  of 
the  delocalization  transition  in  our  one-dimensional  model.  While  DNS  quickly  discuss  the 
effects  of  the  non-linear  term  which  represents  the  competition  between  the  bacteria,  they  do 
not  study  it  in  depth.  They  suggest  that  the  non-linear  term  is  irrevelant,  especially  in  the 
limit  that  the  ring  has  an  infinite  diameter. 

Here  we  consider  the  effect  of  the  non-linear  saturation  term,  -be2,  on  the  delocalization 
transition  and  discuss  its  effects  on  the  dynamics  of  our  bacteria  colony.  The  modified-Fisher 
equation  is  now  examined  in  its  entirity: 

ct  +  ucx  =  Dcxx  +  \J36(x)  -  a]c  -  be2  .  (42) 

4.1  Infinite  ring  on  a  windy  day 

We  start  by  studying  the  infinite  ring.  We  remain  near  to  the  delocalization  transition 
represented  by  the  lines  in  figure  (1),  by  keeping  /3C  nearly  equal  to  the  critical  beta,  /3  =  u, 

0  =  &(l  +  €),  (43) 

where  e  is  a  small  parameter.  We  also  define  a  slow  time  T  and  express  the  concentration  as: 

T  =  et,  c  =  ef  . 

Rewritten  with  these  scalings,  the  adjusted  equation  is 

e/r  +  ufx  =  Dfxx  +  pc(  1  +  e)5(x)f  -  af  -  fee/2  .  (44) 

Expanding  the  eigenfunction  c  =  ef: 

f  =  fo  +  e/i  +  e2f2  +  •  •  •  ; 

Substituting  in  the  expansion  of  f  gives: 

e(/oT  +  e/ir)  +  u(fox  +  tfix)  =  D(foxx  +  efixx)  +  (45) 

&(1  +  e)S(x)(f0  +  e/i)  -  a(f  +  e/i)  -  be(f0  +  e/i)2  .  (46) 

The  zeroth  order  equation  in  e  is: 

Dfoxx  ~  uf0x  +  pc8(x)fo  -  af0  =  0  .  (47) 
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It  is  useful  to  write  (47)  in  terms  of  the  linear  operator  C  (7)  defined  previously  in  section 
3.2,  so  that 


Cf0  =  0.  (48) 

The  solution  /g  for  this  zeroth  order  homogeneous  equation  is 

fo  =  A{T)e^x^  (49) 

where  A(T)  is  the  time-dependent  constant  of  the  solution.  This  is  the  result  we  expect, 
since  the  zeroth  order  solution  represents  the  eigenfunction  at  the  delocalization  transition. 
The  constant  concentration  along  x  is  what  would  see  when  /3C  =  u  in  figure  (1).  For  now, 
we  imagine  that  /?  is  not  quite  equal  to  u.  The  first-order  equation  in  e  is 

for  +  ufix  =  DfXxx  +  f3c$(x)fi  +  M{x)fo  -  afi  -  bfl  .  (50) 

Rewriting  (50)  in  terms  of  the  linear  operator,  C. 

£f\  =  /or  -  Pc${x)fo  +  bfo  ■  (51) 

The  first  order  eigenfunctions  may  be  solved  for  from  (51),  using  the  definition  of  fo  found 
in  (49). 

To  find  the  time-dependence  of  f0  =  A(T),  we  derive  an  amplitude  equation.  Using  the 
adjoint  of  /g,  /g ,  and  integration  by  parts,  it  is  easy  to  show  that 

</o+£/i)  =  (fi^fo)  =  0  •  (52) 

We  may  take  advantage  of  this  fact  if  we  multiply  (51)  by  /J  and  integrate  over  x.  The  left 
hand  side  dissapears  and  the  right  hand  side  reduces  to  the  amplitude  equation 

(fo^fi)  =  (h^fo)  =  0  =  (/gVor)  +  &(/oVo>  -  (Pcfifo S(x))  .  (53) 

Using  our  solution  for  the  zeroth-order  amplitude  fo  and  performing  the  integrals  results  in 
the  amplitude  equation: 


This  result  is  not  unusual  for  a  perturbative  analyis  of  a  non-linear  problem,  and  is  not 
of  much  interest  except  for  comparison  with  the  unique  results  of  the  next  section.  It  is 
reassuring  to  note  that  if  /3C  =  u,  this  amplitude  equation  blows  up,  as  it  should  since  none  of 
the  integrals  we  performed  would  have  converged.  The  solution  to  (54)  is  found  easily  using 
the  Bernoulli  trick  which  transforms  non-linear  equations  to  solvable  linear  equations.  As  a 
check  on  this  result,  one  may  show  that  the  analytical  and  perturbative  energies  agree  at  the 
zeroth  and  first  orders. 
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4.2  Finite  ring  on  a  windy  day:  Part  I 
The  solution 

In  this  section  we  work  with  the  entire  physical  model.  At  the  origin  there  is  an  oasis  where 
the  bacteria  grows;  away  from  the  origin  the  bacteria  struggles  for  life,  dying  at  a  rate  a. 
The  bacteria  diffuses  along  concentration  gradients  and  is  advected  by  a  constant  background 
flow  at  speed  u.  Throughout  the  ring,  the  bacteria  competes,  dying  if  conditions  become  too 
crowded,  thus  adding  to  the  the  resultant  death  rate  of  the  desert.  Finally,  the  bacteria  live 
in  a  finite  domain  with  periodic  boundary  conditions,  e.g.  a  ring. 

The  strategy  here,  is  to  expand  near  the  delocalization  threshold,  as  we  have  done  for  the 
infinite  domain.  It  is  also  assumed  that  the  ring  is  large,  but  finite.  Obviously,  this  is  not  the 
most  general  consideration  of  the  problem  that  can  be  made,  but  it  allows  one  to  proceed 
analytically.  The  meaning  of  a  large  domain  will  be  discussed  below. 

We  begin  by  non-dimensionalizing  the  modified-Fisher  equation  with  the  intention  of 
having  a  firm  grasp  of  the  size  of  each  term.  Non-dimensionalizing  may  obscure  the  physics, 
but  it  clarifies  the  relative  magnitudes  of  terms.  It  is  natural  to  scale  distance  with  the  length 
of  the  ring  and  to  scale  time  with  the  transit  time: 

x  =  y,  t  =  ty,  <5(:r)  =  5 (Lx)  =  y<5(&)  .  (55) 

Ju  Lt  ±J 

The  resulting  equation  is: 

Cl  +  Ci  =  —  ( yfiS(x)  -  a)  c  +  L-c..  -  — c2  .  (56) 

u  \L  J  Lu  u 

Several  more  useful  non-dimensionalizations  are: 

a  =  -a,  =  -fi,  D  =  c  —  \  j ~c  .  (57) 

u  u  Lu  V  a 

These1  non-dimensionalized  constants  contain  the  physical  meaning  of  competing  effects,  a 
is  the  ratio  of  the  decay  rate  a  to  the  advective  transit  time  L/u,  D  is  the  inverse  Peclet 
number,  or  a  measure  of  the  strength  of  diffusion  versus  advection.  Finally,  j3  is  a  measure 
to  the  nearness  of  the  delocalization  transition.  If  |/3|  =  1,  we  are  at  the  transition. 

The  fully  non-dimensionalized  equation  is 

ct  +  Cx  =  Dcxx  +  [/35(f)  —  &]c  —  ac 2  .  (58) 

We  are  interested  in  the  non-linearity  near  the  delocalization  transition,  and  so  we  expand  j3 
around  1.  Using  the  non-dimensionalized  diffusion  coefficient  D  as  the  small  parameter  with 
which  we  expand,  we  have 

$  =  1  +  Dpi  .  (59) 

When  j3  <  1  there  is  a  localized  mode  and  this  mode  becomes  delocalized  if  /3  >  1.  Thus, 
condition  (59),  in  which  is  held  fixed  as  D  — >  0,  ensures  that  the  system  is  operating 
close  to  this  delocalization  threshold.  may  be  positive  or  negative,  putting  us  in  either 
the  localized  or  delocalized  regime,  as  long  as  D  is  small.  Small  D  is  the  equivalent  to  large 
L  or  large  u  since  D  =  D/Lu  (57). 
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Injure  i:  A  schi-narti-:  silusTMtinr,  n\  ’-ho  bounce  ry  Iv.vr  ^rutturc  of  tli-*  solution  if  ;  <i‘.  l. 


Figure  6:  The  outer  and  inner  eigenfunctions.  e  =  D 


4.2.1  The  outer  solution 

Dropping  the  tildes  and  hats  for  notational  simplicity,  and  making  an  expansion  in  the  small 
parameter  D,  c  ~  cq  +  Dc\ ,  (58)  becomes: 

(c0<  +  ecu)  +  (c0x  +  ecix)  =  {06{x)  -  a)(co  +  eci)  (60) 

+  D(c0xx  +  eci**)  -  a{cl  +  2ec0ci)  .  (61) 

The  zeroth  order  equation  is  then: 

cot  +  cox  =  -aco  -  ocq  .  (62) 


A  series  of  tricks  and  substitutions  are  used  to  solve  for  the  zeroth  order  concentration  cq(x,  t). 
The  result  is 


co{x,t) 


fit  -  x) 

(eax  -  1  )f(t  -x)  +  eax  ' 


Checking  this  expression  at  x  =  0,  reveals  c(0,f)  =  f{t).  This  implies  that  all  higher-order 
terms  must  be  zero  at  the  origin.  Meanwhile  attempting  to  demonstrate  the  periodicity  of 
the  system  at  x  =  1, 


f(t)  = 


/(*-!) 

(eQ  -  l)f{t  -  1)  +  eQ  ’ 


(64) 


requires  a  stringent  restriction  of  f(t-x),  suggesting  that  there  is  a  problem  at  x  =  1. 

In  fact,  this  solution  is  an  “outer  approximation”,  which  is  valid  provided  that  0  <  x  < 
1  -  0(V).  The  failure  of  the  outer  solution  at  the  boundary  becomes  apparent,  if  we  define 
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the  restricted  parameter  A, 


A  = 


e~Q 

D' 


or, 


a  —  log 


(65) 


where  A  is  fixed  as  D  — >  0.  This  means  that  a  is  large,  but  not  very  large.  The  parameter  A 
is  necessary  for  a  satisfactory  asymptotic  development. 

In  the  sequel  we  will  treat  a  as  O(D0),  except  when  it  appears  in  exponentials,  where  it 
is  0{Dl).  This  condition  means  that  the  population  which  is  swept  away  from  the  oasis  will 
decay  to  0{D )  on  its  passage  through  the  desert  before  revisiting  the  oasis. 

In  the  limit  D  0,  with  (3\  and  A  fixed,  all  details  of  the  solution  can  be  expressed  in 
terms  of  f(t  -  x)  and  simple  functions  of  x.  The  form  of  the  solution  is  indicated  in  figure 
():  there  is  a  boundary  layer  of  thickness  D  immediately  to  the  left  of  x  —  1. 

Rewriting  the  zeroth  order  concentration  co  in  terms  of  A  and  expanding  in  terms  of 
the  small  parameter  D,  we  see  why  the  outer  approximation  does  not  satisfy  the  periodicity 
requirement.  At  x  =  1,  co(x  =  1,  t)  is: 


c0(x  =  1  ,t)  — 


DXf{t  -  1) 

T+  f{t  -  1)  -  DXf(t  -  1)  ' 


(66) 


Expanding  gives, 


cq(x  =  1,  t)  « 


£>A - 1) 
!  +  /(*-!) 


P2A2/(t  -  l)2 

!  +  /(*  —  !) 


+  •••#/(*)• 


(67) 


Near  x  =  1,  this  outer  solution  of  the  concentration  of  bacteria,  Co  has  decayed  to  0(D)  and 
thus  is  inappropriate  to  describe  this  region  of  the  ring.  The  role  of  the  boundary  layer  at 
x  =  1  is  to  repair  this  failure,  and  so  to  determine  the  evolution  of  f(t).  This  insight  is  the 
most  difficult  part  of  this  asympototic  expansion. 


4.2.2  The  boundary  layer 

We  now  turn  to  the  “inner  region”,  and  introduce  the  stretched  coordinate,  £  =  x/D.  In 
terms  of  £,  equation  (58)  becomes 

Dct  +  %  =  ex  +  D[/3S{D£)  -  a]c  -  Dac2  .  (68) 

Making  an  expansion  in  D  of  the  concentration:  c(£,t)  «  cq  +  Dc\,  we  arrive  at  an  equation 
to  solve  for  the  zeroth  order,  boundary  layer  concentration,  co(£,t): 


co«  —  c0?  =  0  ■ 


The  solution  is 


co  =  /e?  . 


(69) 


(70) 


This  solution  satisfies  the  requirement  that  at  £  =  0  we  have  c(0,t)  =  f(t),  implying  that  all 
higher  order  terms  are  zero  at  the  origin. 
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The  first  order  boundary  equation  is: 


ci£  —  cif£  —  — ' aco  ~  co  ~  cot  •  (71) 

Solving  this  inhomogeneous,  partial  differential  equation  with  the  help  of  the  solution  Co(£,  t) 
in  (70),  gives  the  expression  for  the  first  order  boundary  layer  c\(£,t): 

ci(£,t)  =  (ft  +  ctf)fr*  +  r(t){  1  -ef)  +  ^(e2?  -e?)a/2  ,  (72) 

where  r(t)  is  the  constant  of  integration. 

The  constant  r(t)  is  found  by  matching  the  outer  concentration  with  the  boundary  layer 
in  the  limit  that  £  ->■  -oo,  which  is  equivalent  to  taking  the  limit  where  the  boundary  layer 
dissapears,  D  -4  0.  The  zeroth  order  boundary  solution  Co  is  zero,  and  all  the  terms  but  r(t) 
are  zero  in  the  first  order  boundary  layer  in  this  limit.  For  the  matching,  the  outer  solution  is 
evaluated  at  x  =  1,  which  is  appropriate  in  this  limit,  since  there  is  no  boundary  current  for 
an  infinite  domain.  This  is  an  example  of  a  “switchback”  -  the  0(Dl)  inner  solution  matches 
the  leading  order  outer  solution. 

The  resultant  expression  for  r(t)  is: 


co(£  =  -oo,t)  +  Dci(£  —  — oo,t)  =  c0(x  =  l,f) 


r{t) 


A/(*~l) 

1  +  f(t  ~  1) 


The  first,  order  expression  for  the  boundary  layer  concentration  is  then: 


ci  (£,f)  =  (ft  +af )£e?  + 


A/H ) 

1 +  /(<-!) 


(l  -  +  ^<*/2(e2?  -  ec)  . 


(73) 


(74) 


Thus  r(1)  represents  a  time-delay  of  t  —  1.  The  origin  of  this  term  is  interesting  to  note,  as 
it  will  be  the  origin  of  the  rest  of  our  discussion. 

As  usual,  an  element  of  information  has  been  neglected  by  excluding  evaluation  of  the 
modified-Fisher  equation  at  the  origin.  The  patching  condition  contains  this  information, 
and  can  now  be  evaluated  since  we  have  expressions  for  the  outer  and  inner  solutions.  This 
condition,  obtained  by  integrating  (68)  about  the  orgin,  is 

c^(— )  —  Cf  (+)  =  (1  +  D/3)c(0),  (75) 

where  the  pluses  and  minuses  indicate  evaluation  to  the  left  and  right  of  the  orgin,  and  thus 
imply  whether  to  use  the  outer  or  inner  solution.  For  instance,  c^(+)  is  actually  the  outer 
solution  to  the  right  of  the  origin,  Dc(x  =  0,  t).  Expanding  in  D  gives 

c0?  +  £>ci?  -  Dc0x  -  D2c1x  =  (1  +  D/3i)[co(x  =  0)  +  Dcx(x  =  0)]  .  (76) 

It  is  interesting  to  note  that  the  zeroth  order  outer  solution  is  related  non-trivially  to  the  first 
order  boundary  solution  because  of  the  role  of  the  diffusion  coeffcient  D  as  a  small  parameter. 
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The  zeroth  order  equation  is  trivially  satisfied  due  to  our  choice  of  the  constant  in  the 
solution  (70).  The  non-trivial  first  order  expression  is: 


cx(x  =  0,  t)  -  eg (f  =  0,  t)  +  (1  +  /3i)c(rc  =  0,  t)  =  0  .  (77) 


Differentiating  the  appropriate  expressions  of  the  concentration  and  evaluating  them  at  x  =  0 
(f  =  0)  in  (77)  leads  to  a  very  interesting  amplitude  equation  for  f(t), 


ft  =  (y  -a)f-^af2  + 


hw  - 1) 

!  +  /(*-!)' 


(78) 


This  amplitude  equation  is  a  differential-delay  equation.  The  rate  of  change  of  /  at  any 
t  inie  depends  not  only  on  the  value  of  f  at  that  particular  moment,  but  also  on  the  particular 
value  of  f  at  a  specific  earlier  time,  t  —  1.  Not  suprisingly,  the  time-delay  is  1  time-unit,  or 
the  time  required  for  transit  around  the  ring.  Comparison  to  the  amplitude  equation  for 
the  infinite  ring  reveals  that  we  have  the  same  Ginzburg-Landau  type  terms,  while  the  time- 
delay  piece  is  a  result  of  the  finite-size  of  the  ring.  We  note  that  A  is  the  only  parameter  that 
depends  on  the  length  of  the  ring  L.  As  the  ring  becomes  infinite  in  size,  A  ->  0,  so  that  the 
differential-delay  equation  reduces  to  the  Ginzburg-Landau  equation  of  the  previous  section 
with  redimensionalization. 

A  good  check  is  to  verify  that  the  first  order  energy  obtained  from  (78)  agrees  with  the 
analytic  expression  for  the  energy  of  the  linear  solution.  The  linearized  version  of  (78)  is: 


.  \  Pi  „ 

ft  =  Hr  -  <* 


/  + 


/(*  - 1) 


(79) 


Consistent  with  linearization,  we  assume  this  is  an  eigenvalue  problem  and  let  /  =  est.  The 
expression  for  the  first  order  energy  is  then: 


_  Pi  ^  e—1 

S  2  a+  2D 


(80) 


Knnembering  that  we  have  non-dimensionalized  our  results,  we  work  the  analytic  result,  (23, 
21).  into  the  same  form.  Non-dimensionalization  leads  to, 


2 Dk  {  cosh  k  —  cosh 


2D  J 


(1  +  Dp)  sinhA;  , 


k  =  — v/1+4 £>(a  +  s)  . 


(81) 

(82) 


To  show  the  equivalence,  we  expand  k  in  D  and  drop  any  terms  which  have  e  since 

t  hey  are  very  small. 


k  ~  — —  +  a  +  s  . 

2D 

Using  this  in  the  transcendental  relation  for  k  (81),  leaves  us  with  the  expression: 

+  (aD  +  sD)(ea+s  -  1)  =  D^ea+S 


(83) 


(84) 
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Figure  7:  Periodic  and  chaotic  behavior  of  the  Glass-Mackey  equation  in  various  regimes  (a) 
quasi-periodic  state  when  m=8  (b)  phase  diagram,  m=8  (c)  chaotic  state  when  m=10  (d) 
phase  diagram,  m=10 


Again,  we  consider  which  terms  axe  very  small.  Much  smaller  than  any  of  the  exponential 
terms,  sD  is  dropped,  as  is  aD,  since  a  ~  0(—ln(D)).  Thus,  we  are  left  with  an  expression 
identical  to  the  first  order  energy,  if  we  rearrange  the  following 

- 1  +  (a  +  s)Dea+s  =  D^-eQ+s  .  (85) 

4.3  Finite  Ring  on  a  Windy  Day:  Part  II 
The  Dynamics 

The  next  step  of  this  analysis  is  obvious:  we  should  study  the  stability  of  the  steady  state 
solutions  of  our  amplitude  equation.  Before  beginning  this  analysis,  we  set  the  tone  of  the 
rest  of  this  project  by  suggesting  the  results  that  were  expected. 


4.3.1  Interesting  Zoology  of  Differential-Delay  Equations 

Differential- delay  equations  are  well  know  for  their  periodic,  quasi-periodic,  or  chaotic  be¬ 
haviour,  with  examples  often  arising  in  biology.  One  such  system  is  the  model  suggested  by 
Glass  and  Mackey  [6]  to  describe  the  regulation  of  white  blood  cells.  The  structure  of  the 
Glass-Mackey  equation  is  similar  to  our  differential  delay  equation, 


A  c(t-T) 

Ct  ~  1  +  cm(t-T)  7C’ 


(86) 


and  is  nearly  identical  to  ours,  if  m  =  1. 

This  differential-delay  equation  describes  the  change  in  time  of  the  concentration  of  the 
white  blood  cells  ct .  The  rate  at  which  cells  die  is  proportional  to  c,  e.g.,  —jc.  Meanwhile,  the 
flux,  A  of  new  cells  produced  by  bone  marrow,  is  dependent  on  the  concentration  of  the  blood 
cells  at  some  previous  set  time,  t  —  T  due  to  a  delay  time  T  in  the  production  of  white  blood 
cells.  This  time  delay  exists  because  of  the  time  costs  of  communciation  and  production.  All 
the  parameters,  A,  g ,  m,  T  are  greater  than  zero,  m  is  a  parameter  determined  experimentally, 
and  if  large  enough,  gives  rise  to  limit  cycles  or  chaos.  See  figure  (7). 
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In  our  system,  we  deal  with  our  first  exposure  to  a  delay-equation  and  thus,  a  potentially 
chaotic  system.  The  possiblility  of  chaotic  dynamics  of  the  concentration  of  bacteria  seems 
like  a  fascinating  result.  We  examine  this  possibility  below. 

4.3.2  What  about  us?  A  stability  analysis 

Thus,  one  might  begin  the  stability  analysis  of  the  bacteria-ring  system.  Perhaps  the  three 
free  parameters  of  our  system,  A,  /3,  and  a,  can  have  both  reasonable  physical  values  and 
interesting  dynamics.  Simplifying  the  differential-delay  equation  (78)  with  the  definitions 

£  =  §•-«,  «  =  x  =  \x'  (87) 

the  equation  becomes 

=  +  (88) 

The  steady-states  of  (88)  are  found  by  letting  ft  =  0  where  f(t  —  1)  =  f(t).  This  leads 
to  a  cubic  equation  for  the  roots,  one  of  which  is  zero,  and  the  other  two  roots  are  obtained 
from 


af  +  (a-P)f-(  X  +  p)=0. 


(89) 


The  roots  are 


f±  = 


(/3  —  a)  ±  \/(d  +  /3)2  +  4aA 
2d 


(90) 


Discarding  the  non-physical  negative  amplitude,  we  are  left  with  two  steady-states  which 
the  system  may  tend  towards,  f  —  0  and  /o-  A  study  of  the  stability  is  necessary  to  un¬ 
derstand  the  dynamics.  Expanding  around  the  steady-state  solutions,  we  use  f  =  fo  +  efi- 
The  delay  term  in  (88)  must  also  be  expanded  in  terms  of  e.  We  find  that  the  zeroth  or¬ 
der  equation  for  /o  is  just  the  cubic  equation  obtained  earlier.  The  first  order  differential 
delay-equation  defining  /i  is: 


fit  =  Pfi  -  2d/0/x  + 


fofi  (f  —  1) 
(l  +  /o)2  ' 


(91) 


It  is  useful  to  define  a  function  N, 


N  = 


1 


N'  = 


l  +  /o  ’ 

1 


(l  +  /o)2  ’ 


(92) 

(93) 


to  rewrite  (91),  where  N'  =  dN/dfo-  Since  this  is  a  linear  stability  analysis,  /  =  est,  which 
gives 

s  =  p -2a  f0  +  \e~s  (N  +  f0N')  .  (94) 


135 


Figure  8:  (a)  Various  steady-state  solutions  found  numerically  using  ARCHI.  Parameters  are 
given  with  each  curve,  (b)  Plot  of  Nfo  vs  /o  of  the  modified-Fisher  equation  and  the  Glass- 
M acker  equation.  The  leveling  off  of  the  modified-Fisher  equation  prevents  any  instability. 
The  negative  slope  in  the  Glass-Mackey  equation  is  what  gives  it  the  ability  to  be  unstable. 
The  higher  the  value  of  the  Hill  coeffient,  ra,  the  steeper  the  slope. 


To  study  the  growth  rate  let  s  =  /i  +  iu.  (94)  becomes, 

(i  +  iu>  -  p  +  2a f0  =  Xe-^iN  +  f0N')  .  (95) 

Direct  instabilities  occur  when  /i,  the  real  part  of  s,  is  greater  than  zero.  For  simplicity, 
let  wl-  =  0  and  examine  if  it  is  possible  for  //  >  0.  We  can  show  that  (3  =  afo  -  AAr,  using  the 
zeroth  order  equation,  (89),  so  that 

fi  -  2a/0  =  -a/0  -  A N  .  (96) 

1‘sing  (96)  we  arrive  at: 

fj,  -  afo  +  XN(1  -  e~**)  +  A e~^Nf  .  (97) 

Assuming  u  >  0,  we  find  the  expression  on  the  right  hand  side  to  be  negative  for  all  param¬ 
eters.  This  disproves  our  assumption.  There  can  be  no  direct  instability. 

Hopf  instabilities  are  the  second  possible  type  of  linear  instability.  Breaking  (95)  up  into 
its  real  and  imaginary  parts,  squaring  and  combining  them  gives 

(ji-P  +  2a/o)2  +  u?  =  A  V2^(iV  +  f0N')2  (98) 

For  simplicity,  assume  n  =  0,  and  subtract  w2  from  each  side.  Using  (96),  we  have  the 
inequality 

(g^  +  AT)2  <(N  +  f0N')2  .  (99) 

A 
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(100) 


If  (99)  is  true,  an  instability  exists.  Further  simplification  leads  to 

1/oAT'l 

— A(2  +  /o) 

This  condition  is  obviously  impossible  since  /o,  a,  A  are  all  positive.  Thus,  there  are  no  Hopf 
instabilities.  This  system  is  stable  -  an  acceptable  but  dissapointing  result  for  situation 
represented  by  an  equation  filled  with  such  apparent  possiblity.  We  check  these  results  with 
a  numerical  results,  exploring  some  of  the  3-dimensional  parameter  space.  The  two  agree, 
as  shown  in  figure  (8(a)),  which  plots  several  numerical  results.  One  may  note  that  the 
bumpiness  in  the  amplitude  corresponds  with  the  time  delay,  so  that  at  each  time  unit,  the 
slope  increases  until  the  steady-state  value  is  reached. 


>  2N  — =- 
A 

-  d/o(l  +  /o)2  >  0  . 


4.3.3  Key  to  Instability:  A  comparison 


In  hindsight,  it  is  easy  to  predict  that  our  system  will  be  stable  for  all  parameter  space, 
despite  the  freedom  of  three  independent  parameters,  A,  /3,and  a.  The  hint  is  contained 
in  the  Glass-Mackey  equation  (86).  Rewriting  it  in  terms  of  a  similar  iV,  the  Nqm  of  the 
Glass-Mackey  equation  is, 

«(/,.)  =  T  ■  (101) 

Km(M  =  r  •  <102> 

A  similar  stability  analysis  shows  that  direct  instabilities  are  always  impossible,  while  Hopf 
instabilities  may  exist  if 

\foN'\  >2 N.  (103) 

This  inequality  is  very  similar  to  our  the  bug-ring  system’s  inequality  for  Hopf  bifurcations. 
The  difference  is  due  to  the  non-linear  term  in  our  amplitude  equation  (78).  One  might  think 
that  the  non-linear  term  is  what  is  preventing  instability  -  that  it  is  damping  out  oscillations 
-  but  comparison  of  these  two  inequalities  reveal  that  the  non-linear  term  only  makes  what 
is  already  impossible,  more  impossible.  The  stability  is  a  result  of  the  structure  of  N  and 
Nqm-  Plot  (8)(b)  shows  Nfo  versus  /o,  where  the  slope  is  Xi  =  N  +  foN'.  For  our  system, 
the  slope  is  always  positive  so  that 

Xi  =  N  +  f0N'  >  0  .  (104) 


For  Glass-Mackey, 


Al2  —  Nqm  +  foN'cM  <  0  . 


(105) 


We  see  from  (101)  that  for  a  Hopf  instability  to  exist, 

0>Xi  +  N  +  ^o.  (106) 

A 

This  is  impossible  unless  the  slope,  Xi  <  0,  since  the  other  two  terms  are  positive.  And  so 
we  see  that  the  structure  of  N  is  crucial  to  the  stability  of  these  types  of  systems. 
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4.4  Other  possiblilities  for  instability 

With  the  initial  goals  of  this  work  accomplished,  it  is  fun  to  continue  along  further  lines  of 
investigation.  For  instance,  is  it  possible  to  modify  the  non-linear  saturation  term  so  that 
we  may  find  a  non-steady-state  solution?  Or  what  about  increasing  the  spatial  complexity? 
Or  perhaps  adding  another  delta  function  to  the  ring  or  many  more,  will  lead  to  chaotic 
dynamics.  What  if  the  strength  of  the  delta  function(s)  varies  with  time?  The  first  two  ideas 
are  explored  in  the  rest  of  this  paper. 


4.4.1  Adjusting  the  non-linearity 

We  begin  with  the  simplest  adjustment.  What  happens  if  we  increase  the  non-linearity  in 
the  modified-Fisher  equation  from  -be2  to  -bcnl  Perhaps  this  will  do  something.  However, 
it  was  shown  that  if  we  change  the  power  of  c  to  any  value  n,  that  the  structure  of  N  will  be 


N  = 


1 

(l  +  /on)- 


(107) 


Direct  instabilities  remain  impossible,  while  this  N  still  does  not  yield  the  negative  slope  N' 
necessary  for  a  Hopf  instability. 

In  fact,  it  appears  that  we  must  tailor  a  function  which  has  a  similiar  behavior  to  the 
Glass-Mackey  type  N,  so  that  it  levels  off  at  some  lower  value  than  it’s  maximum.  For  our 
system,  this  seems  physically  unreasonable. 


4.5  Two  delta  functions  and  more... 

Does  spatial  complexity  breed  instability? 

The  second  physically  motivated  suggestion,  is  to  increase  the  complexity  of  the  oasis  and 
desert  zones  to  acheive  interesting  dynamics.  If  one  additional  delta  function  is  added  to  the 
ring,  located  at  position  a',  possesing  a  strength  ft,  while  the  first  delta  function  is  located 
at  a  with  strength  ft,  a  similar  analysis  to  section  (4)  leads  to  two  coupled  differential-delay 
equations: 


ft  =  { y  -  <*)/  “  \af2  + 
,ft  x  3  o 

9t  =  (• y  ~  at)g  -  -ag  + 


5  A g(t  -  a) 

1  +  9{t  ~  a) 

;A7(*  ~  «') 

1  +  f{t-  a') 


(108) 

(109) 


where  A  =  e~aa/2D  and  A'  =  e~aa'  /  2 D  In  general,  for  independent  parameters,  these  equa¬ 
tions  yield  8  steady-state  solutions  (fo,9o),  if  we  count  f0,go  =  0.  A  study  of  the  stability  is 
quite  complex.  A  few  things,  however  can  be  said: 

1.  No  direct  instabilities  exist,  for  any  set  parameters. 

2.  Equal  parameters,  ft  =  ft,  a  =  o',  results  in  the  amplitudes  /o  and  go  always  being 
equal,  (/o  =  go).  This  case  reduces  to  the  1-delta  function  case,  so  there  is  no  interesting 
zoology  here. 
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3.  When  the  parameters  axe  not  equal,  /o  never  equals  go-  Stability  has  not  been  proven 
for  this  case,  although  numerical  tests  suggest  that  the  system  is  stable. 

The  first  two  results  are  also  true  a  system  composed  of  n  delta  functions.  The  coupled 
delay-equations  are  straightforward  to  derive: 

ht  =  ft/  -  a/2i  +  , 

f2t  =  &2 f  ~  «/2 2  +  ’ 

fnt  =  hf  ~  af2n  +  ~1-  -  (110) 

1  +  /n-1 

It  would  be  nice  to  develop  a  technique  to  study  the  stability  of  all  the  steady-state  solu¬ 
tions  for  2-delta  functions  (and  then  n-delta’s)  which  is  more  straightforward  than  the  usual 
algebraically  complex  method.  Perhaps  this  will  be  accomplished  as  our  familiarity  with 
delay-equations  grow,  just  as  the  simple  discovery  of  N  made  the  analysis  of  the  simplest 
case  swifter  and  less  convoluted. 

5  Conclusion 

In  conclusion,  the  modified-Fisher  equation  and  the  delocalization  transition  has  been  studied 
in  detail  for  a  large  ring  and  in  the  distinguished  limit  that  A  is  0(1).  While  the  differential- 
delay  equation  was  an  unexpected  result,  it  is  an  interesting  property  of  the  system  which 
deserves  more  study  in  complex  inhomogenous  backgrounds.  It  is  also  suggested  that  a  time- 
dependent  delta  function  could  model  oscillations  of  illumination  due  to  cloud  cover,  or  the 
day /night  cycle,  as  well  as  lead  to  an  interesting  problem  with  a  new  delay-equation.  Other 
work  may  also  be  done  in  different  parameter  regimes,  especially  in  the  small  diameter  limit. 
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Coupled  Nonlinear  Oscillators 


Roberto  Sassi 


1  Introduction 


Mutual  synchronization  is  a  common  phenomenon  in  biology.  It  occurs  at  different  levels, 
ranging  from  the  small  scale  of  the  cardiac  pace-maker  cells  of  the  SA  (Sino- Atrial)  and  AV 
(Atrium- Ventricular)  nodes  in  the  human  hearth  that  synchronously  fire  and  give  the  pace 
to  the  whole  muscle,  to  the  coordinated  behaviours  of  crickets  that  chirp  in  unison  and  of 
fireflies  that  flash  together  in  some  parts  of  southeast  Asia. 

The  dynamics  of  coupled  oscillators  is  a  very  broad  field  of  research;  the  approach  we 
have  chosen  is  only  one  of  the  many  that  are  possible.  The  question  we  would  like  to  answer 
is  something  like:  “What  special  phenomena  can  we  expect  to  arise  from  the  rhythmical 
interaction  of  whole  populations  of  periodic  processes ?”  [1]. 

Winfree  [1]  was  the  first  to  underline  the  generality  of  the  problem,  fixing  the  first  as¬ 
sumptions  for  a  mathematical  model.  In  his  work  each  oscillating  species  (cell,  or  cricket, 
or  firefly)  is  modeled  as  a  nonlinear  oscillator  with  a  globally  attracting  limit  cycle;  The 
oscillators  were  assumed  to  be  weakly  coupled  and  their  natural  frequencies  to  be  randomly 
distributed  across  the  population. 

Kuramoto  [2]  proposed  the  first  model  (called  for  this  reason  the  Kuramoto  model).  His 
assumptions  were  that  each  oscillator  is  equal  to  the  others,  upto  the  frequency  and  phase, 
that  the  system  has  a  mean  field  coupling  and  that  the  amplitudes  of  the  oscillations  are  all 
the  same  (phase-only  model).  The  equation  of  the  model  for  the  n  oscillator  is: 

=wn  +  — ^sin(0j-0n)  +  £„,  (1) 

j- 1 


where  K  is  the  coupling  strength ,  ujn  is  a  random  variable  with  probability  density  function 
g(oj)  and  £n  is  white  noise. 

Defining  as  order  parameter  the  complex  number, 


re 


N 


(2) 


it’s  possible  to  measure  the  synchronization  among  the  oscillators  phases:  r  =  0  corresponds 
to  the  completely  incoherent  state,  finite  r  to  synchronization. 

Kuramoto  determined  that  r  =  0  is  always  a  steady  solution;  but  there  exists,  in  the 
case  of  no  added  random  noise,  a  critical  value  of  the  coupling  parameter  Kc  —  -~2^-  below 
which  only  incoherent  populations  exist  (r  =  0).  For  K  >  Kc  a  population  of  synchronized 
oscillators  can  exist  (r  >  0). 
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Time  evoution  of  the  order  parameter 


time 

Figure  1:  Numerical  simulations  on  the  discrete  Kuramoto  model  with  N  =  256,  D  =  0.01,  K  =  0.65; 
(upper)  time  evolution  of  the  probability  density  function  computed  on  the  trajectories  of  the  system 
splitting  up  the  9  axe  in  sub-intervals;  (lower)  time  evolution  of  the  absolute  value  of  the  order 
parameter 


Time  evolution  of  the  distribution  of  oscillators 
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Figure  2:  Numerical  simulations  on  the  discrete  Kuramoto  model  with  N  =  256,  D  =  0.01,  K  =  0.8; 
(upper)  time  evolution  of  the  probability  density  function  computed  on  the  trajectories  of  the  system 
splitting  up  the  9  axe  in  sub-intervals;  (lower)  time  evolution  of  the  absolute  value  of  the  order 
parameter 
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The  results  of  the  numerical  simulations,  performed  solving  equation  (1)  with  N  =  256 
and  two  different  values  of  the  coupling  parameter  K,  are  shown  in  figures  (1)  and  (2);  in  the 
upper  panels  the  time  evolution  of  the  discrete  probability  density  function  1  is  plotted;  in 
the  lower  panels,  the  time  evolution  of  the  order  parameter  is  displayed.  The  initial  condition 
is,  in  both  cases,  a  population  of  oscillators  with  phases  uniformly  distributed  in  [0,  27t]. 

When  K  =  0.8  >  Kc,  in  a  very  short  time,  the  phases  of  the  oscillators  gather  together  in 
a  small  range  of  angles  and  then  begin  drifting  coherently.  The  order  parameter  grows  quickly 
and  exhibits  small  oscillations  due  to  the  random  noise  added  to  the  system  (figure  (2)). 

A  different  situation  arises  with  K  =  0.65  <  Kc\  a  coherent  behaviour  never  starts,  even 
if  small  structures  can  be  noticed:  small  population  of  oscillators  synchronize  and  drift  for 
short  periods  of  time.  This  is  reflected  in  the  order  parameter  that  oscillates  between  0  and 
0.3  and  decreases  only  slowly  (figure  (1)). 


1.1  A  continuous  model 


Using  the  approach  sketched  in  the  previous  paragraph,  it’s  difficult  to  go  much  farther;  it’s 
not  easy,  for  example,  to  answer  questions  such  as  ”Is  the  coherent  state  (K  >  Kc)  stable?” 

Strogatz  &  Mirollo  [3]  introduced  a  partial  differential  equation  that  describes  the  be¬ 
haviour  of  the  Kuramoto  model  in  the  limit  N  -4  oo. 

The  idea  is  that,  in  the  continuous  limit,  the  state  is  described  by  a  probability  density 
function:  p(0.u>,t).  The  Kuramoto  equation  (1)  becomes: 

/oo  r2x 

/  sin(</>  —  9)p{(f),(j,t)g(u))d(t>d(jj,  (3) 

-oo  JO 


where  v  is  the  velocity  at  the  point  (8,u >,t).  Moreover,  the  density  function  p  has  to  satisfy, 
for  each  given  u,  a  normalization  law 


(4) 


and  a  Fokker-Plank-type  conservation  law  2 


A  A  ^2 

—p(0,u,t)  +  —(p(0,v,t)v(0,wt))  =  D^p(0,v,t),  (5) 

1  The  6  axis  is  divided  into  64  intervals  and,  at  each  instant  of  time,  the  normalized  histogram  of  the  phases 
of  the  oscillators  is  computed. 

2 The  derivation  of  the  two  equations  has  the  flavor  of  the  BBGKY  hierarchy  in  plasma  physics  and  can  be 
found  in  [4].  Some  rationalization  of  equation  (5)  can  be  given  on  recollecting  that  because  the  probability  is 
conserved, 


d_ 

dt 


2 

/  md»  = 

J0! 


Pt  = 


p(0i)v(6i)  -  p{62)v{92) 


and  on  remembering  Einstein’s  derivation  of  the  diffusion  equation  in  his  work  on  the  explanation  of  the 


144 


(6) 

2  Linear  Stability  Theory 

Strogatz  &  Mirollo  [3]  worked  on  the  linear  stability  of  the  continuous  Kuramoto  equation. 
\\V  will  try  to  sketch  the  main  results,  useful  for  the  discussion  that  follows. 

With  direct  substitution  into  the  equations  (3)  and  (5),  it  can  be  seen  that  Po  =  ^  is  a 
steady  state  solution  for  the  system;  it  corresponds  to  the  incoherent  state  with  r  =  0. 

By  linearizing  p  around  the  steady  state  solution,  that  is 

p  =  Po  +  e(c(u;,  t)el 6  +  c*  (a;,  t)e“z6>)  +  h.h., 

where  e  is  a  small  parameter  and  c*  is  the  complex  conjugate  of  c,  then  substituting  into  (3) 
and  introducing  the  notation 

/oo  r2ir 

/  sin(<£  —  0)p(<j),  ujy  t)g{<jS)d(j)du , 

-oo  J 0 

it  can  be  seen  that  G  is  different  from  zero  only  for  functions  that  have  a  component  on 
the  bases  elB  and  e~t6 .  That  is,  the  higher  harmonics  do  not  give  any  contribution  and  the 
linearized  equation  (5)  becomes 

K  roc 

ct  =  -(D  +  iu))c+—  c{v,  t)g{v)dv.  (7) 

^  J  — OO 


where  6  G  [0;  27 r]  and  u)  G  [-oo;  oo]. 

The  order  parameter  (2),  in  the  continuous  limit,  becomes: 


re 


/oo  r2ir 
-oo  J 0 


e*ep(6,  co,  t)g(u)dddw. 


The  discrete  spectrum  can  be  computed  by  seeking  solutions  of  the  form  c(u>,  t)  =  b(uj)ext. 
By  substituting  into  equation  (7),  multiplying  by  g(u})  and  integrating  over  w.  one  finds  the 
dispersion  relation, 


1 


*  r 

2  J_0 0  A  +  D  +  iv 


(8) 


\\'hen  A  is  negative,  the  order  parameter  decays  and  the  system  reverts  to  the  incoherent 
state:  vice  versa  for  lambda  positive,  the  order  parameter  exponentially  grows  and  this  is,  in 
the  coupled  oscillators  system,  the  onset  of  synchronization. 


The  system  has,  also,  a  continuous  spectrum  at  ui  =  —iD  (see  [3]).  As  the  dissipation  is 
always  positive,  the  modes  in  the  continuous  spectrum  are  either  all  decaying  or,  at  most, 
neutrally  stable  when  D  =  0. 

Brownian  motion  [5],  which  indicates  that 

Pt  =  2  {f)  pee 

=  Dpee 
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Figure  3:  Continuous  spectrum  for  the  dispersion  relation  (8)  either  for  the  case  D  ^  0  (left)  and 
for  the  noise-free  case  with  D  =  0  (right).  The  discrete  spectrum  is  composed  of  only  one  mode  that 
exists  for  K  >  Kc  and  lies  to  the  right  of  the  continuous  spectrum  (inside  the  grey  region  in  the 
picture);  in  the  noise-free  case  the  discrete  mode  either  is  unstable  or  doesn’t  exist. 


2.1  An  example 

Choosing  as  probability  density  function  a  lorentzian,  that  is 


the  dispersion  relation  can  be  solved  analytically;  the  computed  growing  rate  A  is 

sgn(X  +  D)  A  =  — —  1  +  D 
z 

and  the  critical  coupling  (that  is  the  value  of  K  at  which  the  system  is  neutrally  stable)  is 

Kc  =  2(1  +  D). 

If  D  =  0,  then  Kc  =  ;  this  is  the  same  result  that  Kuramoto  found  working  on  the 

discrete  system,  as  described  in  the  introduction. 


Summarizing,  for  K  <  2,  the  system  does  not  have  any  discrete  mode;  for  2  <  K  <  Kc 
the  system  is  stable  and  exponentially  decaying;  if  K  =  Kc  it  has  a  neutral  mode  and  with 
K  >  K c  an  unstable  growing  mode  (figure  (3 (left))). 

It’s  interesting  to  notice  that,  if  the  dissipation  is  zero,  the  system  can  only  be  either 
unstable,  with  a  growing  mode  (K  >  Kc),  or  neutrally  stable,  with  no  mode  ( K  <  Kc) 
(figure  (3(right))).  But,  looking  at  figure  (4 (right)),  it  can  be  seen  that  the  order  parameter 
is,  however,  decaying  exponentially. 


How  can  we  explained  this  apparent  contradiction?  Let’s  look  at  the  solution  of  the  initial 
value  problem  with  K  <  Kc  and  the  initial  condition 


c(w,0)  =  - 


7T  U2  +  4 


146 


Figure  4:  Time  evolution  of  the  order  parameter  for  the  Kuramoto  continuous  model;  compact 
support  within  [— 1;  1]  for  g(u)  =  initial  condition  p0  =  ^  and  pi  =  (left) 

D  =  0.01,  Kc  =  0.739,  £  =  0.001  and  K  =  0.8:  over-critical  coupling,  the  order  parameter  grows 
linearly  and  then,  when  the  nonlinearity  becomes  strong  enough,  saturates;  (right)  D  =  0,  Kc  =  2  —  ^ 
,  £  =  0.1  and  K  =  0.5:  under-critical  coupling,  the  order  parameter  grows,  initially,  and  then  decays. 
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F  igure  5:  (a):  real  part  of  the  first  Fourier  component  of  p(6,  ui,  t)  at  two  different  times,  obtained  with 
the  numerical  simulation  described  in  figure  (A (right)).  ( b ):  real  part  of  the  first  Fourier  component 
of  (>{9.  jj.  t)  at  time  t  =  60  analytically  computed.  In  both  case  is  possible  to  observe  the  increasing 
number  of  oscillations. 


Integrating  equation  (7)  via  Laplace’s  transform,  we  find  that 


c(w,<)  =  {- 


1 


1 


7r  u;2  +  4 
5  1 

37r  2ioj  —  1 


37T  2 iuj  —  1 

e-(f -D*  + 


3tt2 
1  1 

37T  2  —  iu 


iuj 


}e! 


iujt 


-2 1 


Fvidently,  the  function  c(w,  t)  is  proportional  to  the  non-decaying  and  non-separable  term 
( .  As  time  goes  on,  this  term  becomes  increasingly  crenellated. 

What  we  are  seeing  here  is  equivalent  to  the  Landau  damping  in  plasma  physics;  the 
order  parameter  is  proportional  to  the  integral  of  the  function  c(ui,t);  even  if  the  latter 
doesn't  decay,  as  soon  as  it  starts  crenellating,  the  positive  and  negative  part  cancel  and  the 
integral  decreases. 

In  fact,  computing  the  absolute  value  of  the  order  parameter,  we  have 

10  4  -2 1 

r  =  Te  ’  -3e 


which  decays  exponentially  as  t  — »  oo. 


3  Numerical  Integration 

The  integration  of  the  discrete  model  (equation  (1))  has  been  performed  with  a  fixed  step 
(At  =  0.1),  fully-implicit  predictor-corrector  scheme.  The  fixed  time  step  is  forced  by  the 
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noise  added  to  the  derivatives  that  avoids  the  convergence  of  most  adaptative  methods. 

In  the  continuous  model  (equations  (5)  and  (3))  the  density  function  p  is  periodic  with 
period  2tt,  so  the  latter  can  be  expanded  in  the  Fourier  series: 

p(<W)=  £  p(e,t)eime 

m~~  oo 


Substituting  into  the  equations  (5)  and  (3),  we  obtain  the  system  of  nonlinear  ordinary 
differential  equations  for  the  Fourier  coefficients  pm: 


Pot  =  0, 

Pmt  =  -KTrm(p-i)pm+i  -  {irrujj  +  Dm2)pm  +  Knm{pi)pm-i 
for  m^O 


where 


f(u)g(uj)du} 


(9) 


For  each  value  of  u>,  truncating  the  Fourier  series  at  m  =  L,  the  system  can  be  efficiently 
integrated  (a  semi-implicit  Adams-Bashfort-Moulton  predictor-corrector  scheme  leads  to  the 
inversion  of  a  tri-diagonal  matrix).  We  used  L  =  16,32,64,128  in  the  computations;  the 
smaller  the  dissipation  or  the  longer  time  windows  considered,  the  bigger  the  number  of 
Fourier  components  necessary  to  approximate  properly  the  Kuramoto  system.  We  found 
L  =  32  a  good  compromise  in  many  situations. 

A  little  more  attention  is  needed  for  the  evaluation  of  the  integral  (9).  We  found  that 
the  most  efficient  way  of  computing  it  is  via  Gauss-Legendre  quadrature  formulae,  setting 
compact  support  for  <?(u>),  and  using  the  solution  for  p,  produced  at  the  previous  available 
time  step. 

In  figures  (6)  and  (7)  the  solutions  computed  for  p(6,u,  t)  are  shown;  in  the  sub-critical 
case,  when  the  coupling  parameter  K  is  smaller  than  Kc,  stripes  of  probability  can  be  noticed, 
which  increase  in  number  and,  slightly  tilting,  start  shrinking.  At  fixed  9,  this  is  the  same 
crenellation  as  described  above  and  seen  in  figure  (5). 

When  K  >  Kc,  in  the  super-critical  case,  the  probability  gathers,  initially,  in  a  stripe¬ 
like  area,  but  immediately  also  starts  to  deplete  from  the  central  region.  Unlike  before,  the 
number  of  stripes  doesn’t  increase  (in  this  case  there’s  no  Landau  damping);  two  areas  collect 
the  whole  probability.  The  process  is  reminiscent  of  the  formation  of  a  shock  layer  in  the 
white  regions  in  figure  (7),  but  a  truely  weak  solution  does  not  form  due  to  the  dissipation 
introduced  by  the  noise. 


4  A  symmetry  property 

Looking  at  the  numerical  results  of  the  previous  section,  it  can  be  observed  that,  starting 
from  p(9,  Lo,  0)  and  g(oj)  which  are  even  functions  in  ui,  a  symmetry  is  preserved  during  the 
evolution  of  the  dynamics;  that  is  p(9,u,t)  —  p{-9,  —w,  t).  This  behaviour  can  be  explained 
in  a  general  way. 
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e  e 

Figure  6:  p(6,w,t)  at  six  successive  instant  of  time  obtained  via  numerical  integration  of  the  Ku- 
ramoto  continuous  model  with  D  =  0  and  K  =  0.5  (see  figure  (4) (right)  for  further  details ).  The 
coupling  K  is  sub-critical. 
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Figure  7:  p(9:uj,t)  at  six  successive  instant  of  time  obtained  via  numerical  integration  of  the  Ku- 
ramoto  continuous  model  with  D  =  0.01  and  K  =  0.8  ( see  figure  (4)(right)  for  further  details ).  The 
coupling  K  is  super-critical. 
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Let’s  start  stating  that  the  probability  density  function  is  so  that,  for  each  integer  n, 


p{8 ,  u,  t)  =  p(9  +  27rn, u,  t )  -  p(27rn  -  8,  —  u,  t )  =  p(-9 ,  — w,  t). 

The  symmetry  property  will  be  proved  by  construction. 

Let’s  expand  sm(<p  -  8)  in  equation  (3);  the  first  integral  can  be  rewritten 


(10) 


(11) 


/oo  r" 

/  sin((f>)p{(f>,uj,t)g(ui)d<f>du} 

-OO  J  —  7T 

/OO  /*0 

/  sm((j))p((f),<jj,t)g(u))d<l)du}  + 

-OO  «/  — 7T 

/OO  /»7T 

/  sm((f>)p((f>,u},t)g(uj)d(j)du]-, 

-oo  «/ 0 

If  we  make  the  change  of  variables  4>  —  ~4>  and  w'  =  -w  and  we  use  equation  (10),  expres¬ 
sion  (11)  becomes 

/oo  r'K 

/  sin(^/)p(^>',  a/,  t)g{u')d<j>  dJ  + 

■oo  Jo 


-oo 

/OO  /*7T 

/  sm((j))p((f),uj,t)g(uj)d<t>d(jj 
■oo  Jo 


=  0. 


At  the  end, 


/OO  /"TT 

/  cos  {(f))p{(l),u,t)g(u))d<f)d( 

-OO  ^  —  7T 

=  —Q(t)sin(8). 


(12) 


Substituting  (12)  into  equation  (5),  it  can  be  noticed  that  p(—9,—uj,t)  is  a  solution  of  the 
Kuramoto  model. 


5  Weakly  Nonlinear  Theory 

When  K  =  Kc ,  the  system  is,  by  definition,  linearly  neutrally  stable.  In  this  situation  and 
for  D  ±  0,  Bonilla  &  al.  [6]  and  Crawford  [7]  developed  an  asymptotic  expansion  around 
the  equilibrium  solution  and  derived  a  Landau-type  ordinary  differential  equation  for  the 
amplitude  of  the  perturbation. 

Defining  e  <  1,  they  chose  the  scalings, 

e2t  =  T 
dt  =  dre 2, 

where  T  is  the  slow  time  (when  t  is  very  big,  T  is  small), 

K  =  Kc  +  e2K2  +  --- 
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and  expanded  the  function  p  such  that 


P(0,",T)  =  X  + 

=  e(p1(u})al(T)eie  +  p\{u)a\{T)e-id)  + 
=  eHp2(u)a2(T)ei6  +  p*2{u)a*2{T)e~i6)  + 
=  lS)  + 
=  o(e4)- 


From  equations  (5)  and  (3)  it  follows  that 

^N  =  ~u%-WK  SZSo  ^-O^Wdu+D^ 

and 


G  =  eGi  +  e2G2 


At  the  first  order  (0(e))  : 


Pi  = 


K  f -oa  Pi{v)9(y)&/ 


2  (cj  —  %D ) 

Continuing  to  0(e3),  eventually,  one  derives  the  amplitude  equation, 


K2 


air  =  -yai  +  (Kcn)  X\aifai 


where 


X  =  /  Pi9(u)  ^  =  constant 
u)  -  2 %D 


-s: 


We  can  see  from  the  expression  for  pi  that  this  kind  of  approach  is  not  satisfactory  when 
D  —  0;  p  diverges  at  cj  =  0,  but  as  long  as  it’s  a  probability  density,  this  can  not  have  any 
physical  meaning  and  has  to  be  avoided. 

5.1  Weakly  Nonlinear  Theory:  0,  but  small 

In  this  section  we  try  to  sketch  how  it’s  possible  to  develop  an  asymptotic  expansion  for  the 
case  1  >  D  ^  0.  Let’s  choose  the  variables  to  scale  as  in  the  previous  case  (Hopf  scaling) 
but  let’s  say  that  D  —  e2D2.  The  motivation  for  this  scaling  can  be  found  in  the  numerical 
experiments  we  performed  and  in  the  analysis  by  Daido  and  Crawford  [4], 

Substituting,  as  in  the  previous  paragraph,  into  the  equations  (5)  and  (3)  we  have 

e2pr  +  +  (pG)e)(Kc  +  e2K2)  =  e2D2p96 
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and  at  the  first  and  second  order  we  can  derive  the  following  expressions  for  pi  and  p2  : 


O(e)  :  wpie  + 


GieKc 
2tt 

KCGX 
2iru 
GmKc 


:0, 


0(e2)  :  up2e  +  -f-JL  =  -Kc(p1G1)6, 


2tx 
K2G? 


if 


Here  is  the  problem  we  have  to  overcome;  to  treat  properly  the  case  D  <C  1  we  need  a 
critical  layer  around  u  =  0;  in  fact,  at  this  point,  the  asymptotic  expansion  breaks  down  and 
p2  is  bigger  than  p\  . 

Inside  the  critical  layer,  we  set  the  new  independent  and  dependent  variables 


a?  =  ey 

p  —  Z(9>y,T)  =  Zq  +  eZ\, 

The  equation  in  the  inner  region  becomes 

e2ZT  +  eyZ6  +  (§  +  (ZG)g)(Ke  +  e2K2)  =  e2D2Z99 
and  at  the  first  two  orders 
0(e)  : 


0(e2) 


where  T  is  a  function,  that  for  our  purpose  we  not  need  derive. 

Even  in  the  inner  region  we  still  have  trouble:  when  y  +  KCG\  =  0,  Zq  diverges  and  Z\ 
diverges  even  more;  the  asymptotic  expansion  breaks  down  another  time. 

This  is  very  peculiar;  the  first  inner  layer  is  not  sufficient  at  all  and  it’s  necessary  for 
another  critical  layer,  a  second,  inside  the  first.  Moreover  the  shape  of  this  inner  layer  is 
peculiar:  it  develops  around  the  curve  y  +  KCG\  =  0.  The  whole  situation  is  sketched  in 
figure  (8). 

In  this  second  inner  layer  we  define  the  new  independent  variable 

y  +  KcGx=eQ( 


yZoe  +  ^^  +  Kc(Z0G1)e  =  0, 


Zn  —  — 


yZie  + 


2ir 

1  KcGi 

2tt  y  +  KCG\ 
G29Kc 


2tt 


+  {Z\Gi)gKc  = 


—Zqt  -  (ZqG2)9Kc  +  D2Z099, 

kcg2. 


(( y  +  KcGx)Zl 

T(  °'T 


2i r 


■)e  = 


Gig 


ie 


(y  +  KcG 02  ’  (y  +  KcGiY ’  (y  +  KcGi )3 


) 
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Figure  8:  Weakly  Nonlinear  Theory.  (A):  When  the  dissipation  is  small  (0(e2))  a  critical  layer 
becomes  necessary  around  the  frequency  u  =  0  ( light  gray  region );  the  width  of  this  layer  is  of  order  e. 
(B):  The  sketched-box  in  (A)  is  enlarged:  inside  is  possible  to  notice  the  second  “snake-like”  critical 
layer  ( inner-inner  layer,  dark  gray  region ),  that  is  necessary  around  the  line  y  +  KCG \  =  0.  (C): 
The  sketched-box  in  (B)  is  enlarged  again:  the  width  of  the  inner-inner  layer  is  of  order  ea  (a  =  1  if 
Do  =  0:  a  =  1/2  if  D2  ^  0). 
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and  the  new  dependent  variable 


z  =  nej±J^hT). 

Deriving  a  new  equation,  in  this  second  inner  layer,  is  very  technical  and  we  will  not 
develop  it.  The  reason  lies  in  the  strange  relation  between  T,  6  and  £,  imposed  by  the  shape 
of  the  layer.  On  the  change  of  variable  to  £,  the  0  and  T  derivatives  pick  up  additional 
£ —derivatives, 

IH.r  =  M{/  +  ^[r  d»,T 

[Zr]s,„  =  [tY  ]f,,  +  ^[T(]„ 

and  this  makes  it  much  more  complex  to  derive  useful  analytical  expressions. 

We  conclude  this  section  sketching  an  argument  for  determining  the  scaling  parameter  a. 
We  can  deduce  from  the  relations  (13)  the  scaling  for  Zo  and  Zx, 

0(e)  :  Zo  ~  0(e~a) 

0(e2)  :  Zx  ~  0(e~2a  OR  D2e~3a). 

Hence  Z\  scales  in  different  way  depending  on  the  value  of  D2 . 

We  now  apply  the  condition  that  Zo  and  eZ\  have  to  scale  at  the  same  order,  which  is 
where  the  expansion  formally  breaks  down  and  we  enter  the  innermost  region.  There  are 
two  different  situations,  depending  upon  the  value  of  D2 :  if  D2  =  0,  then  a  =  1,  and  when 
D2  7^  0,  then  a  =  1/2.  The  reason  of  the  scaling  we  chose  for  D  becomes,  now,  more  clear  and 
we  would  have  lost  this  double  behaviour  with  a  higher  scaling  of  the  noise  term  ( D  ->  e^D 
with  P  >  2). 

Summarizing  the  results  in  this  section,  when  D  is  finite  but  small,  the  derived  asymptotic 
expansion  reveals  that  we  need,  at  least,  two  critical  layers,  one  inside  the  other.  The  first, 
around  u  =  0,  is  of  order  e;  the  second,  snake-shaped,  is  inside  the  first  and  has  a  characteristic 
width  that  depends  upon  the  value  of  £>2. 

6  Steadily  propagating  solutions 

In  this  section  we  look  for  steadily  propagating  solutions  for  p,  that  is 

p(6, u>,  t)  =  p(8  —  f It,  uj), 
where  Cl  is  the  propagation  velocity. 

Substituting  this  expression  into  equation  (3)  and  making  the  change  of  variable  $  = 
4>  —  Clt,  we  find 

/oo  r2Tr 

/  sin(^>  —  6)p(<p  —  CH,u})g(u>)d<f)duj 

-oo  Jo 

/oo  p2ir 

/  sin($  —  (8  —  Clt))  p($ ,  uj)g(u>)d$du) 

-oo  J 0 

=  G(8  —  Clt,uj). 
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From  this,  we  can  observe  that  G  preserves  the  dependence  on  (0  —  fit)  that  p  is  supposed 
to  have. 

Equation  (5),  in  the  situation  D  =  0,  becomes 


(w  —  fl)p#  +  k(Gp)e  —  0; 


integrating  over  6 


uj  —  £2  +  kG{9  —  fii) 


(13) 


(14) 


where  J(w)  is  a  generic  function  of  uj. 

As  N.J.B.  says  in  these  situations,  we  are  in  bad  shape ;  as  long  as  p  is  a  probability 
distribution  function,  it  can  not  have  singularity;  as  long  as  u  is  a  real  variable  that  spans 
the  entire  real  axis,  the  pole  from  the  denominator  of  p  is  hard  to  avoid. 

This  is  an  interesting  point;  if  the  dissipation  is  zero  the  system  does  not  admit  steadily 
propagating  solutions.  But  what  happens  when  the  dissipation  is  small ? 

If  D  ^  0,  equation  (13)  has  an  additional  term,  and  after  the  integration  over  9 ,  we  have 


(w  -  fl)p  +  kGp  =  J(w)  +  Dpg. 


(15) 


For  the  sake  of  simplicity,  let’s  say  that  we  have  fixed  u  to  a  certain  value;  then  there’s  a 
value  of  ©  =  0  -  fit  at  which  the  denominator  of  equation  (14)  vanishes;  we  call  this  value 
A. 

As  D  -»  0,  p  scales  as  1/(0  -  A);  hence  we  need  an  inner  layer  in  the  proximity  of 
0  —  A  =  0  in  which  the  dissipative  term  becomes  important.  For  this  reason  we  choose  the 
following  scaling  and  variables, 


0- A 


P 

D 


eS 


Substituting  in  equation  (15)  and  noting  that  d$  =  9©  =  e  2dg  we  have 

kGe(A)8R  =  J(uj)  +  Rs 


Integrating  over  8,  we  can  find  the  expression  of  the  probability  distribution  function  in  the 
inner  layer  and  check  the  condition  for  the  matching  with  the  outer: 


(Re 


Rg 

-kGft(A)*2  . 

2  )i 


R 


kG&(A)5R-  J{u) 


-J(W)e^ 


fcGft(  A>5: 

Roe  2 


2  v  fcGo(A>*2 

-J(u)e  2 


-fcGe(A)<S2 

e  2  do. 


The  value  of  Rq  can  be  computed  with  the  normalization  condition  (4). 
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Figure  9:  Steadily  propagating  solution  with  ft  =  0:  a  sketch  of  p{0,u).  Near  the  phase  9  (where 
lj  4-  KG (9)  =0  and  Gl{9)  <  0)  a  critical  layer  ( light  gray  region)  is  necessary.  In  the  outer  layer  p  is 
zero  everywhere. 


This  solution  is  potentially  dangerous;  it  has  to  be  bounded,  otherwise  it  again  diverges; 
moreover  in  the  limit  of  6  going  to  infinity  (that  is,  going  out  of  the  inner  layer)  R  has  still 
to  be  limited,  for  a  proper  matching  with  p .  Using  these  two  conditions,  we  fix  the  value  of 
J(cj)  and  i?0 

J(u>)  =  0  for  Gq( A)  <  0, 

J (w)  =  0, i?o  =  0  for  Ge( A)  >  0,  (16) 


The  matching  with  the  outer  layer  is  straightforward;  as  S  goes  to  infinity,  R  it’s  zero  and 
p  is  zero  everywhere.  In  figure  (9)  is  shown  a  sketch  of  the  situation  for  ft  =  0. 

Summarizing  this  result,  steadily  propagating  solution  can  not  develop  for  D  =  0;  with  D 
finite,  but  small,  they  exist  only  in  a  small  layer,  that  follows  the  line  (w— Q  +  kG(9  —  ftt))  =  0 
where  G© (A)  <  0. 

The  probability  density  p  is  exponentially  small  everywhere  in  the  outer  layer;  for  this 
reason  the  integral  for  G  is  limited  to  the  inner,  and 


G(e,t) 


/oo  r2ir 

/  sin($  —  Q,t))p($iU))g(Lj)d$dt 

-oo  Jo 

/oo  roo 

/  sin(A(o;)  -  0,  t))R0 

-OO  j  —  OO 


fcGQ(A)<^ 

e  2  g{u))d5duj 


6.1  An  example:  ft  =  0 

Let’s  say  that  ft  =  0,  so  that  we  look  for  a  steady  solution  of  the  kind  p(0,o;).  In  this  case, 
steadily  propagating  solutions  can  develop  only  along  the  line  u  +  KG(6 )  =  0. 

Choosing  a  symmetric  initial  condition  and  applying  relation  (12),  the  curve  becomes 

to  =  KQ(t)  sin(0). 
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Moreover,  along  the  line,  solutions  can  form  only  where  Gg  <  0,  that  is 


-m 


d  sin(0) 
dd 


d  sin(#) 


d6 


<  0 
>  0, 


as  K  and  Q  are  both  positive.  In  figure  (10)  this  prediction  is  compared  to  the  actual  solution,' 
obtained  with  the  numerical  simulations.  As  it  can  be  seen,  the  agreement  is  evident  and 
this  explains  the  depletion  of  probability  noticed  in  the  middle  of  the  plane  0-lj,  in  figure  (7). 


7  The  case  g(u)  =  S(w ) 

In  the  previous  two  sections  we  appreciate  that,  if  we  set  D  small,  but  different  from  zero,  it’s 
possible  to  perform  an  asymptotic  expansion  and  steadily  propagating  solutions  can  develop. 
But  what  happens  when  D  =  0?  The  described  approach  only  underlined  that  this  is  a  critical 
situation.  In  this  section,  with  D  =  0,  we  will  show  how  is  possible  to  solve  analytically  the 
Kuramoto  equations  in  the  case  that  g(w)  is  a  Dirac’s  delta  function. 

In  this  case  all  the  oscillators  share  the  same  frequency  w  =  0  and  have  different  phases. 
Starting  from  a  symmetric  initial  condition,  that  is  p(0,O)  =  p(-0, 0),  and  using  prop¬ 
erty  (12),  equations  (5)  and  (3)  become 

/*27T  coo 

pt{0,0,t)  —  kdg(p(6,0,t)sm(9)  /  /  cos(<£)p(<^,0,t)<5(o;)<iwd</>)  =  0 

JO  J — oo 

c27 t 

Pt(0,t)  -  kdg{p{6,t)sm(6)  cos {<fi)p(<j),t)d<j>  =  0; 

Jo 

Setting 

c27T 

F{t)  =  /  cos  (4>)p{<j),t)d<t>, 

Jo 

the  last  relation  becomes 


Pt  —  k  sin(0).F(t)  pg  =  kcos(6)T(t)p. 


(17) 


We  solve  this  equation  by  the  method  of  characteristics;  for  equation  (17)  the  character¬ 
istic  equation  is 


dt  _  dO  __  dp 
1  —  ksm(6)iF(t)  kcos(9)J:r(t)p' 


Hence, 


—  =  — fcsin^^i) 
dt 

=  k  cos(d)  !F(t)p. 
dt 
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Figure  10:  p(0,  u t)  at  t  =  180  obtained  via  numerical  integration  of  the  Kuramoto  continuous  model 
with  D  =  0.01  and  K  =  0.8  (see  /ipure  (J^)(left)  for  further  details).  UPPER :  contour  plot  of  p;  the 
super-imposed  sketched  line  is  u;  =  KG(t)  sin(0).  The  steadily  propagating  solution  (with  0  =  0)  can 
develop  only  where  G'(6)  <  0,  as  expected.  LOWER:  plot  of  p.  Looking  back  to  figure  (7)  is  now 
possible  to  understand  why  the  probability  is  zero  in  the  middle  of  the  pictures. 
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Solving  the  first,  we  obtain 


Un(?)  =  Un(|  (18) 

dividing  the  first  with  the  second 

I  =  -p« ot(«) 

psin(0)  =  constant  =  po(0o(0>£))sin(0o(0,t)). 


The  derivation  here  becomes  a  little  technical: 

2tan(6>0/2)  2tan(g/2)£(t) 

—  l+tan2(go/2)  _  l+tan2(fl/2)£2(tf 

P  ~  Po  2  sin(0/2)  cos  (0/2)  ~  Po  2  sin(0/2)  cos (0/2) 
e~kq 

P°  e~2kq  cos2  (0/2)  +  sin2  (0/2) 
where  <7  =  ^T{t')dt'  and  £  —  ekq. 

As  time  goes  on,  we  expect  the  order  parameter  to  saturate  to  a  certain  constant  value; 
this  is  what  we  saw  in  the  numerical  simulations.  The  order  parameter  is,  by  definition, 
proportional  to  G  by  a  factor  2n.  Calling  T  the  saturation  value  of  G  then  q,  the  integral  of 
T,  becomes 

Q  Tt  as  t  — t  00. 


Choosing,  now,  the  initial  condition  po  =  cos(0o),  and  recalling  relation  (18),  we  find 

_  1  —  tan2(0o/2)  _  1  —  tan2(0/2)£2 

Po  l  +  tan2(0o/2)  1  +  tan2(0/2)£2 

e-2fcgCos2(0/2)  -sin2(0/2) 
e-2kq  cos2(0/2)  +  sin2(0/2) 

Finally,  substituting  po  into  the  expressions  for  p  and  q  we  have 

_kq  e~2kq  cos2  (0/2)  -  sin2  (0/2) 

P  6  (e~2kq  cos2(0/2)  +  sin2(0/2))2 


and 


dq  _  -kq  f27T  rn  e~2fcgcos2(0/2)  —  sin2 (0/2) 

dt  J0  ^  (e-2fc?  cos2(0/2)  +  sin2(0/2))2  ^ 


(19) 


The  integral  in  (19)  can  be  evaluated  in  the  limit  that  q  goes  to  infinity.  Making  the 
change  of  variable  0  =  2xe~kq  and  expanding  in  Taylor’s  series  the  trigonometric  functions, 
it  becomes: 


dq 

dt 

Q 


- 2x 2 

(1  +  x2)2 


dx  =  — 27r 


— 27Tt 
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Summarizing,  as  the  time  goes  on,  q  grows  linearly;  p  becomes  small  everywhere  except 
near  0  =  0  where  p  ~  ekq ,  so  that  it  grows  exponentially.  The  dynamics  brings  probability 
toward  a  singular  phase  (the  point  0  =  0  here);  this  is  exactly  what  we  saw  in  the  numerical 
simulation,  although  with  different  g( a;),  and  along  a  curve  on  the  (0,a/)  plane. 

Concluding,  the  solution  is  a  spike-shaped  shock-like  object.  This  is  also  consistent  with 
another  fact,  which  is  namely  that,  with  g(cj)  =  6(u)  and  the  alternative  coupling,  sin(cj>  — 
0)  -4  5 ((f)  —  0),  the  Kuramoto  model  reduces  to  the  Burger’s  equation, 

pt(6,0,t)  +  [Kp2(eAt)]e. 

8  Discrete  vs.  Continuous 

An  important  question  is  whether  the  continuous  approximation,  made  in  the  limit  N  ->■  oo, 
remains  valid  with  a  relatively  small  number  of  oscillators.  That  is,  if  the  results,  obtained 
in  the  previous  sections,  are  applicable  to  equation  (1). 

We  don’t  have  a  final  answer  to  such  a  question,  but,  comparing  figures  (11)  and  (12), 
obtained  from  the  discrete  model  with  N  =  256,  with  (6)  and  (7),  can  be  seen  that  the  main 
behaviours  axe  the  same.  In  particular,  in  figure  (12),  we  see  the  probability  gathering  in  two 
symmetric  areas  of  the  plane,  as  in  the  continuous  case  of  figure  (7). 

A  peculiarity  of  the  discrete  case  is  that  the  concentration  of  probability  drifts  in  the 
0  direction;  we  suppose  this  to  be  related  to  the  fact  that,  although  the  probability  den¬ 
sity  function,  from  which  the  initial  condition  is  extracted,  is  symmetric,  the  actual  initial 
condition  is  not. 

9  Conclusions  and  other  remarks 

The  Kuramoto  model  generates  many  interesting  results;  many  more  than  what  we  were 
expecting.  So,  in  this  report,  for  brevity  reasons,  we  have  omitted  several  of  the  analyses  we 
made.  We  mention  two  particular  ones  here: 

First,  we  studied  the  issue  of  transient  amplification:  starting  from  a  situation  of  equi¬ 
librium  with  a  sub-critical  coupling,  and  imposing  perturbations  of  different  intensities,  one 
can  find  solutions  to  the  linear  initial-value  problem  that  grow  to  arbitrarily  large  amplitude 
before  decaying.  One  important  question  is  whether  this  transient  growth  induces  nonlinear 
behaviour  in  the  full  system  before  the  disturbance  can  decay.  But,  from  the  numerical  sim¬ 
ulations,  the  system  appears  to  be  very  robust,  with  a  decay  that  is  very  much  similar  to 
what  predicted  by  the  linear  theory  no  matter  how  big  the  transient  growth.  In  other  words, 
the  nonlinearity  of  the  system  doesn’t  provide  any  new,  unexpected  behaviours. 

Second,  we  also  applied  Nyquist  methods  to  the  linear  stability  problem:  in  the  linear 
analysis  section  we  computed  analytically  the  value  of  the  growth  rate  and  the  critical  value 
of  the  coupling  parameter  for  the  Lorentzian.  For  general  g(u ),  the  integral  in  (8)  can  be 
solved  numerically  to  furnish  similar  results,  but  in  many  situations  it  is  helpful  to  have  a 
quick,  general  understanding.  We  used  the  Nyquist  criteria  to  derive  the  following  result 
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Figure  11:  p(9:oj,t)  at  six  successive  instant  of  time  obtained  via  numerical  integration  of  the  Ku- 
ramoto  discrete  model  with  D  =  0.01  and  K  =  0.65  ( Kc  =  0.739,  sub-critical  coupling ).  Both  0  and 
w  axes  are  discretized  in  16  intervals.  Compact  support  within  [ — 1;  1]  for  g{oS)  =  initial 

condition  p0  =  “  nnd  pi  =  4^7  with  f  =  0.1. 


024  6  0246 
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Figure  12:  The  same  as  in  figure  (11)  except  K  =  0.8  {super- critical  coupling). 
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that,  here,  we  only  state:  the  maximum  number  of  unstable  growing  rates,  for  the  linearized 
Kuramoto  system,  is  the  number  of  monotonic  pieces  of  g(u>)  divided  by  two. 

On  the  other  hand,  there  is  further  work  to  be  done  in  at  least  two  different  directions: 

•  the  comparison  of  the  discrete  and  continuous  model  is  only  sketched  and  needs  more 
numerical  explorations.  Moreover,  do  traveling  solutions,  as  we  saw  in  figure  (12),  exist 
in  the  continuous  case?  In  figure  (1)  structures  can  be  noticed,  even  if  the  coupling  is 
sub-critical  and  we  are  expecting  incoherence.  What  is  the  origin  of  these  structures? 
Do  they  only  depend  on  the  initial  condition? 

•  The  change  in  the  form  of  the  coupling  seems  to  be  critical  (see  ad  example  [4]  and  [8]). 
What  happen  if  the  coupling  is  different  than  sin (<j)  —  0 )?  Why? 
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Effect  of  a  Simple  Storm  on  a  Simple  Ocean 


Jeff  Moehlis 


1  Introduction:  Near-Inertial  Oscillations  and  Storms 


Horizontal  motion  of  a  free  particle  on  the  Earth’s  surface  subject  only  to  the  Coriolis  force 
is  governed  by  the  equations 


dup 

dt 


fvp  =  0, 


dvp 

dt 


+  /  Up  —  0, 


where  uv  and  vp  are  respectively  the  eastward  and  northward  components  of  the  particle’s 
velocity  in  the  frame  rotating  with  the  Earth,  /  =  2 Q,e  sin  (f>  is  the  Coriolis  parameter,  Qe  is 
the  frequency  of  the  Earth’s  rotation,  and  <f>  is  the  latitude  (see,  e.g.,  [4]).  This  has  solution 
Up  +  ivp  =  e1!'  (uq  +  u>o),  where  uo  and  Vo  are  the  initial  components  of  the  velocity.  This 
corresponds  to  the  particle’s  velocity  describing  a  circle  of  radius  (uq+Vq)1^2  /  f  with  frequency 
/.  In  the  northern  hemisphere,  /  >  0  and  the  particle  rotates  in  a  clockwise  direction  when 
viewed  from  above.  The  inertial  frequency  /  is  the  low-frequency  cutoff  for  internal  waves  in 
the  ocean.  An  internal  wave  with  frequency  near  /  is  called  a  near-inertial  oscillation  (NIO). 
About  half  of  the  total  kinetic  energy  associated  with  internal  waves  in  the  ocean  is  contained 
in  NIOs  [5]. 

There  is  much  observational  evidence,  starting  with  [17,  14],  that  wind  from  storms 
can  excite  near-inertial  currents  in  the  mixed  layer  of  the  ocean;  recent  observations  include 
[8.  13.  15].  Simple  models  which  treat  the  mixed  layer  as  a  solid  slab  have  been  quite  successful 
at  explaining  the  process  by  which  wind  generates  such  currents  (see,  e.g.,  [14,  5]).  These 
currents  decay  away  after  the  storm  passes,  with  possible  mechanisms  for  the  decay  including 
nonlinear  interactions  which  transfer  energy  to  other  frequencies  [12],  turbulent  dissipation 
[11].  and  the  radiation  of  downward  propagating  NIOs  excited  by  inertial  pumping  into  the 
interior  of  the  ocean  [10].  The  last  mechanism  will  be  the  focus  of  this  paper.  Such  downward 
propagation  of  NIOs  is  believed  to  be  a  significant  mechanism  for  mixing  in  the  upper  ocean. 

Observations  give  a  time  scale  for  the  decay  of  the  energy  deposited  by  the  passing  storm 
on  the  order  of  ten  to  twenty  days  [8,  13,  15].  This  time  scale  is  in  contrast  with  estimates 
such  as  that  by  [10]  that  near-inertial  currents  decaying  through  the  downward  propagation 
of  NIOs  and  with  a  horizontal  length  scale  typical  of  the  atmospheric  forcing  mechanism 
can  remain  in  the  mixed  layer  for  longer  than  a  year.  To  account  for  this  difference,  several 
mechanisms  for  the  enhancement  of  vertical  propagation  of  NIOs  have  been  suggested  (these 
are  nicely  summarized  in  [16]),  including  smaller-scale  fluctuations  within  the  storms,  the 
0  effect  [6],  and  interaction  with  background  geostrophic  or  quasigeostrophic  flow  (see,  e.g., 
[2,3,16]). 
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This  paper  considers  the  vertical  propagation  of  near-inertial  energy  and  shear  deposited 
into  the  mixed  layer  by  a  storm  in  the  presence  of  the  fl  effect.  The  analysis  uses  the  formalism 
of  [18]  which  is  briefly  discussed  in  Section  2.  In  Section  3,  a  simplified  model  with  three  main 
assumptions  is  outlined.  First,  the  background  flow  is  assumed  to  be  independent  of  longitude 
and  the  associated  vorticity  is  assumed  to  be  zero.  Second,  the  buoyancy  frequency  is  taken  to 
be  approximately  zero  in  the  mixed  layer,  and  constant  in  the  interior  (i.e.,  beneath  the  mixed 
layer) .  Third,  it  is  assumed  that  the  storm  has  moved  very  rapidly  across  the  ocean  and  has 
created  a  horizontally  uniform  near-inertial  current  to  the  east  concentrated  within  the  mixed 
layer.  Section  4  uses  the  fact  that  the  depth  of  the  ocean  is  very  much  larger  than  the  mixed 
layer  depth  to  formulate  and  solve  the  model  for  an  ocean  which  is  (effectively)  infinitely 
deep.  Section  5  discusses  the  results  and  suggests  directions  for  further  investigation. 


2  Formalism 

Consider  the  ocean  to  be  infinite  in  horizontal  extent  and  of  depth  D,  with  the  mixed  layer 
being  the  portion  of  the  ocean  with  <  z  <  0,  and  the  interior  the  portion  with 

— D  <  z  <  The  x  and  y  axes  are  taken  to  point  to  the  east  and  north,  respectively. 

The  buoyancy  frequency  N  =  N(z)  is  an  arbitrary  piecewise  continuous  function  of  depth  z. 


2.1  Evolution  Equation 

Young  and  Ben  Jelloul  [18]  derive  an  evolution  equation  for  a  complex  field  A(x,  y,  z,  t )  from 
which  leading-order  NIO  motion  in  the  presence  of  a  steady  barotropic  background  flow  and 
the  fl  effect  can  be  deduced: 

LA‘  +  l^+'2^A  +  i{f)y+\<)LA=°,  (1) 


where 


ip  is  the  streamfunction  for  the  background  flow,  (  =  V2ip  is  the  associated  vorticity,  and 
the  Coriolis  parameter  f  =  fo  +  fly-  Here  V  is  the  horizontal  gradient,  and  V2  =  d2  +  d2. 
Subscripts  denote  partial  differentiation.  The  asymptotic  expansion  used  in  the  derivation 
of  equation  (1)  relies  upon  the  frequency  of  near-inertial  waves  being  close  to  the  inertial 
frequency  fo-  The  NIO  velocity  field  (u,v,w),  buoyancy  b,  and  pressure  p  are  given  by 

u  +  iv  = 
w  — 

b  = 

P  = 


e~ifotLA  (3) 

-\flN~2{Axz  -  iAyz)e~if°t  +  c.c. 

^foi-A-xz  —  iAyz)e  +  c.c. 

^(Ax  -  iAy)e~lfot  +  c.c. 
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Here  b  is  related  to  the  density  p  by 


P  =  Po 


b' 

9. 


5 


where  po  is  the  reference  density  at  the  top  of  the  ocean.  Note  that  p  has  been  normalized 
by  po- 

The  boundary  conditions  are  that  w  vanishes  at  the  top  and  bottom  of  the  ocean;  this 
corresponds  to  the  boundary  condition  Az  =  0  at  2  =  0  and  2  =  -D.  This  boundary 
condition  along  with  equation  (3)  implies  that 

/  (u  +  iv)  =  0.  (4) 

J-D 


Thus,  the  barotropic  motion  is  not  included  in  the  analysis;  note  that  [10]  shows  that  the 
barotropic  response  to  a  storm  is  instantaneous  and  the  associated  currents  are  weak. 


2.2  Jump  Conditions 

Suppose  that  the  buoyancy  frequency  is  discontinuous  at  z  —  z<*.  Integrating  equations  (2) 
and  (3)  from  z  =  z&  —  8  to  z  —  Zd  +  8,  the  following  jump  condition  is  obtained: 


=  eifot 


rzd+S 

/  (u  +  iv)dz. 

Jzd-S 


The  left  hand  side  tends  to  zero  as  <5  — >■  0  provided  u  and  v  remain  finite  (which  must  be 
true  on  physical  grounds).  Thus,  is  continuous,  even  when  N2  is  discontinuous.  Now 

assuming  that  ip  and  (  have  no  ^-function  behavior  in  the  z  direction,  integrating  equation 
(1)  over  the  same  interval  in  z  implies  that 


lim 

<5-M) 


V2A  dz  =  0. 


Thus  V2^4  =  Axx  +  Ayy  is  continuous  across  z  =  z^. 


2.3  Energy  and  Shear 

The  quantities 


u2  +v2  =  ].£/.A|2, 


2  1  2 
uz  +  vz 


(5) 


give  local  measures  of  the  horizontal  kinetic  energy  per  unit  mass  (hereafter  HKE)  contained 
in  near-inertial  motion  and  the  associated  vertical  shear,  respectively.  Using  equation  (1) 
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and  its  complex  conjugate, 


8\LA\2 

dt 


LA*t  LA  +  LAt  LA * 

•  [LA  VA*  -  LA*  VA)]  - 


cWALAf) 

d(x,y ) 


.Hi  I  — 

2  \dz 


rfl 

N* 


(VA*  •  VA  -  VAZ  •  VA*) 


(6) 


where  the  star  denotes  complex  conjugation.  In  the  following,  it  will  be  useful  to  integrate 
this  over  a  volume  in  order  to  determine  how  the  energy  contained  in  horizontal  near-inertial 
motion  in  the  volume  depends  on  the  value  of  derivatives  of  A  evaluated  on  the  surface  of 
the  volume.  A  general  equation  for  the  evolution  of  the  shear  is  not  given  here,  but  will  be 
for  the  simplified  model  considered  next. 


3  A  Simplified  Model 

To  simplify  the  analysis,  it  is  assumed  that  A  and  ip  do  not  vary  in  the  x  direction,  and  that 
(  =  0.  The  analysis  thus  keeps  the  /3  effect  but  neglects  the  effect  of  background  barotropic 
vorticity.  The  buoyancy  frequency  profile  is  taken  to  be 

N2  =  62Nq  0 

N2  =  N2  —  D  <  Z  < 

where  f«l.  Finally,  the  storm  is  assumed  to  produce  the  initial  condition  of  a  horizontally 
uniform  near-inertial  current  to  the  east  concentrated  within  the  mixed  layer. 

Instead  of  approaching  this  problem  by  projecting  onto  normal  modes  (see,  e.g.,  [2,  10]), 
the  problem  will  be  formulated  as  an  initial  value  problem  on  a  semi-infinite  domain  corre¬ 
sponding  to  an  effectively  infinitely  deep  ocean.  In  order  to  formulate  the  problem  properly 
for  this  limit,  this  section  considers  an  ocean  of  finite  depth.  In  Section  4  the  solution  in  the 
limit  that  the  depth  of  the  interior  is  much  greater  than  the  mixed  layer  depth  will  be  found. 


3.1  Nondimensionalization 


Quantities  are  nondimensionalized  according  to 


z  = 


Hr, 


+  1) 


t  =  fit, 


where 


Typical  values  /3  ~  10  nm  Ls  J,  Hm \x  ~  100m,  fo  ~  10  4s  1,  Nq  ~  10  2s  1  give  Y  ~  105m 
and  fi  «  10-6s-1.  The  relevant  time  scale  is  thus  1/fi  «  11.5  days.  Also,  with  a  view  to 
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specifying  the  initial  velocity  profile  according  to  equation  (.3),  the  velocity  and  the  field  A 
are  nondimensionalized  as 


(u,v) 


U  ’ 


A  -  &  A 

UN2H2m ix  ’ 


where  U  is  a  characteristic  value  of  the  velocity.  The  hats  will  be  dropped  for  ease  of  notation. 
With  this  nondimensionalization,  the  buoyancy  frequency  profile  is 

N 2  =  e2  0  <  z  <  1 

N2  =  1  -H  =  — -=?-  +  !  <z<0, 

-Hmix 

and  equation  (1),  the  boundary  conditions,  and  initial  condition  become 


Azzt  +  2  N2Ayy  +  iyAzz  —  0  (7) 

Az  =  0  z  =  -H,  z=l  (8) 

Azz  =  N2u  t  =  0.  (9) 

The  jump  conditions  in  nondimensional  form  are 

AZ\Z=(j+  =  e  AZ\Z=0~,  Ayy\z=0+  —  Ayy\z—0~,  (10) 


where  z  =  0+  and  z  —  0  are  the  limits  as  z  ->  0  from  positive  and  negative  z  values, 
respectively. 

This  nondimensionalization  allows  some  immediate  conclusions  to  be  drawn  about  the 
propagation  of  NIO  energy  and  shear  downwards.  Most  importantly,  if  Hm jx  increases  then 
the  timescale  1/Cl  decreases.  Thus,  assuming  that  the  storm  causes  a  uniform  near-inertial 
current  throughout  the  whole  mixed  layer,  energy  and  shear  transfer  will  be  faster  for  a 
deeper  mixed  layer.  This  confirms  the  results  of  [10],  which  associated  the  more  efficient 
t  ransfer  with  a  larger  projection  of  the  initial  velocity  profile  on  the  first  vertical  mode. 


3.2  Integral  Energy  Relations  and  Energy  Flux 

The  nondimensional  local  kinetic  energy  per  unit  mass  is  u2  +  v2  =  \AZZ/N2\2.  The  nondi- 
tnensional  form  of  equation  (6),  with  the  assumptions  of  the  simplified  model,  is 


d_ 

dt 


N 2 


2  N2  dy 


—(A  A* 

a.\AzzAy 


zAy)  +  2]y2  ~Q/JAyzAy  ~  AyzAy)- 


(11) 


Let. 


p  p  0  poo  p  00  p  pi  poo  poo 

/  dV=  dz  dx  dy,  /  dV  =  dz  dx  dy 
J INT  J —H  J— oo  J—  oo  J  ML  J  0  J— oo  J —oo 


be  the  integrals  over  the  interior  of  the  ocean  and  the  mixed  layer,  respectively.  Assuming 
AzzAy  -  A*zzAy  vanishes  for  |y|  ->  oo  and  using  equation  (8)  gives  the  following  results: 

A  p  a  /*oo  poo 

-  \Azz\2dV  =  -  /  (A*yzAy-AyzA*y)\z=0-dxdy,  (12) 

7 INT  1  7-oo  J-oo 
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2 


(13) 


{AyZAy  AyZAy)\z—o+dxdy. 


Equations  (10), (12)  and  (13)  may  be  combined  to  give 

Jt  f  14 

at  yINT 

This  is  a  statement  of  conservation  of  HKE  in  nondimensional  form. 

The  quantity  FE(y,  t)  =  \(A*yzAy  -  AyzA*)\z=0-  is  the  flux  of  HKE  from  the  mixed  layer 
to  the  interior  of  the  ocean.  Letting 


2dV  + 


if 

dt  Jul 


I A 


dV  =  0. 


(14) 


p  p—a  poo  poo 

/  dV  =  dz  dx  dy, 

Jvd  J-H  J- oo  J- oo 

similar  arguments  show  that 

|  f  \Azz\2dV=  ["  r  FE(y,t-,d)dxdy, 

JVa  J  — oo  J — oo 


where 


(15) 


FE(y,t-d)  =  l-{A*yzAy  -  AyzA*)\z=-d  (16) 

gives  the  flux  of  HKE  from  the  region  z  >  -d  to  the  region  z  <  -d.  Equation  (12)  is  a 
special  case  of  equation  (15)  with  FE(y,t)  —  FE{y,t;  0"). 


3.3  Integral  Shear  Relations  and  Shear  Flux 

In  Section  3.4,  it  will  shown  that,  to  leading  order  in  e,  A  is  independent  of  2  in  the  mixed 
layer,  and  thus  there  is  no  shear  in  the  mixed  layer.  For  z  <  0  the  nondimensionalized 
buoyancy  frequency  is  IV  =  1,  so  the  vertical  shear  from  equation  (5)  may  be  written  in 
nondimensional  form  as  u2z  +  v1  =  \AZZZ\2.  Similar  arguments  to  those  leading  to  equations 
(6)  and  (11)  give  the  evolution  equation 

w  IA“*12  =  5  ~ + \  t{A'^Ay:  ~ 

Assuming  AZZZA*Z  —  A*zzzAyz  vanishes  for  \y\  — ►  oo, 

if  \Azzz\2dV  =  f  [  Fs{y,t;d)dxdy , 
at  Jvd  J  — oo  J  — oo 

where 

Fs{y,t\d)  =  —  (AyzzAyz  —  AyZZAyz)\z=^(i  (17) 

is  the  flux  of  vertical  shear  from  the  region  z  >  —d  to  the  region  z  <  -d. 
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3.4  Boundary  Condition  at  Base  of  Mixed  Layer 

For  0  <  z  <  1,  equation  (7)  becomes 

■^zzt  H"  26  Ayy  +  iyA-zz  —  0. 

Expanding  A(y,  z ,  t)  =  A0{y,  z,  t )  +  e2A2{y,  z,t)  +  0(e4), 

Aozzt  +  iyAozz  =  0. 

Integrating  this  subject  to  the  boundary  condition  that  Az  and  thus  Aoz  vanishes  at  2  =  1 
gives 

A,  =  e~iyt  f  giyA'yy1' dt' 

Jo 

for  some  function  g.  In  particular,  Aq  is  independent  of  z.  At  0{e2) 

i 

A2 zzt  H”  W^-2 zz  A  2^^yy  =  (1®) 

which  may  be  integrated  subject  to  the  boundary  condition  that  A2Z  vanishes  at  2  =  1  to 
give 

A2 zt  +  iyA2z  +  2  A$yy(z  “  1)  =  0. 

Evaluating  at  2?  =  0+  and  using 

Ayy  —  A0yy  +  O  )  ,  AZ  =  ^ ^2Z  +  O  )  ,  Atf  =  £  +  O  (c^) 

implies  that 

Azt  4-  iyAz  —  ~Ayy  =  0(e4)  z  =  0+ 

Finally,  applying  (10)  gives  the  boundary  condition 

Azt  ”1“  iyAz  ~  “ Ayy  =  0  z  —  0  (19) 

to  leading  order  in  e. 

3.5  Initial  Condition 

Suppose  that  in  a  short  time  compared  with  the  NIO  wave  propagation  time  the  passing  storm 
causes  near-inertial  currents  in  the  mixed  layer  with  no  horizontal  structure.  For  simplicity, 
the  initial  velocity  is  assumed  to  be  piecewise  constant  with  depth.  Thus  the  initial  velocity 
profile  (consistent  with  equation  (4))  is  taken  to  be 

u  =  1  0  <  z  <  1, 

u  =  —\z  -  H  <  z  <  0, 

jfz 

v  =  0  -  H  <z  <  1. 
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Integrating  equation  (9)  with  respect  to  z  and  using  the  boundary  conditions  (8)  then  gives 
at  t  —  0 


Az  =  e2(z- 1)  0  <  z  <  1  (20) 

Az  =  -Z—^~  -H<z<  0.  (21) 

4  Solution  for  an  Infinitely  Deep  Ocean 

The  total  depth  of  the  ocean  is  typically  on  the  order  of  a  hundred  times  the  depth  of  the 
mixed  layer;  thus,  the  limit  of  infinite  depth  is  considered.  The  initial  condition  is  taken  to 
be  equation  (21)  with  H  — t  oo.  The  boundary  condition  for  z  -*  — oo  is  taken  to  be  Azz  — >  0, 
corresponding  to  the  near- inertial  velocities  vanishing  at  infinite  depth  (see  equation  (3)). 
This  limit  does  not  invalidate  the  use  of  equation  (1)  which  assumed  hydrostatic  balance  and 
t  hus  holds  for  the  ocean  having  depth  much  smaller  than  the  horizontal  scales.  The  ocean 
still  in  reality  has  finite  depth,  but  for  depths  just  below  the  mixed  layer  it  is  effectively 
infinitely  deep.  Of  course,  this  limit  excludes  the  possibility  of  reflections  off  the  bottom  of 
the  ocean  which  may  be  important  (see,  e.g.,  [9]);  thus,  the  results  should  be  viewed  as  what 
would  happen  in  the  absence  of  such  reflections.  Finally,  the  boundary  condition  for  z  =  0~ 
given  by  equation  (19)  is  used.  For  convenience,  the  problem  to  be  solved  for  the  semi-infinite 
domain  z  <  0  is  summarized: 


\ l 

AZzt  +  2 Ayy  +  iyAZz  =  0 

z  <  0 

A-zt  +  iyAz  —  “  Ayy  —  0 

1 

o 

II 

AZz  0 

z  — oo 

H 

1 

II 

t  =  0. 

4.1  Solution  by  Laplace  Transforms 

Making  the  ansatz  and  definitions 

A{y,z,t)  =  e~iytB(z,t),  T  =  B(z,T)  =  B(z,t),  (22) 

implies  that 


BZzT 

-i6-0 

z  <  0 

(23) 

Bzt 

+  |b  =  o 

z  =  0" 

(24) 

BZz  0 

z  — >  — oo 

(25) 

Bz  =  - 1 

T  =  0. 

(26) 
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Laplace  transforming  equations  (23)-(25)  in  time  gives 


b(z,p)  =  C[B(z,t)\  =  / 
Jo 

B(z,T)e~pTdT 

(27) 

~  1 

pbzz  ~  Bzz(z,  0)  ~  —6  =  0 

z  <  0 

(28) 

bZz  0 

z  -»  —OO 

(29) 

pbz-Bz{z,0)  +  ~b  =  0 

z  =  0~. 

(30) 

Bzz(z, 0)  =  0  from  equation  (26);  thus  the  solution  to  equation  (28)  satisfying  the  boundary 
condition  (29)  is 


HZ,P)  =  /(P)exp 

“  S  T2ei'li=12{1+i)- 

Using  boundary  condition  (30)  with  Bz(z, 0)  =  -1  from  (26)  determines  /(p),  giving 

b(z’p)  =  cxp  (3) '  <31) 

In  principle,  the  problem  is  solved  at  this  stage;  inverting  this  Laplace  transform  gives  B(z ,  T), 
then  A(y,z,i)  is  obtained  from  equation  (22).  This  can  then  be  differentiated  in  order  to 
determine  various  quantities  of  interest.  In  practice,  it  is  more  convenient  to  first  differentiate 
with  respect  to  0  as  appropriate,  and  then  to  invert  the  transform.  This  inversion  may  be 
done  in  three  different  ways:  first,  for  z  =  CT  analytical  expressions  can  be  found,  and  these 
can  be  used  to  find  analytical  expressions  for  0  <  z  <  1;  second,  for  other  z  values  the 
inversion  may  be  done  numerically;  and  third,  the  asymptotic  behavior  can  be  determined 
by  the  method  of  steepest  descents.  But  first,  expressions  for  the  flux  of  energy  and  shear  in 
terms  of  the  field  J3(z,  t)  are  given. 

4.2  Flux  of  Energy  and  Shear 


The  energy  and  shear  fluxes  given  in  equations  (16)  and  (17)  may  be  related 
using  equation  (22),  giving 

to  the  field  B 

FE(t;d) 

=  l-t2(B*zB-BzB*)\z=-d 

(32) 

=  t2[Im(£z)Re(£)  -  Re{Bz)lm{B)]\z=-d 

(33) 

Fs(t',d )  = 

l-t2(B*zzBz  -  BZZB*Z) \z=-d 

(34) 

= 

t2[Im(Bzz)Re(Bz)  -  Re(JB22)Im(S2)]|z=_d. 

(35) 
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These  fluxes  are  independent  of  y  so  the  dependence  on  this  variable  is  suppressed.  Also, 
initially  Azz  =  0  for  z  <  0.  Integrating  equation  (15)  with  respect  to  time  then  gives 


/oo  roo  rt 

dx  dy 
-oo  J  —  oo  JO 


Fs{t;  d)dt. 


Thus 


E(t;d)  =  [  FE{t',d)dt 

Jo 


(36) 


is  the  total  amount  of  HKE  which  has  penetrated  into  the  region  z  <  —d.  Calculating  the 
initial  amount  of  HKE  in  the  mixed  layer  for  the  initial  velocity  profile  shows  that  E(t\  d)  — >  1 
corresponds  to  all  energy  originally  in  the  mixed  layer  having  reached  depths  below  z  =  —d. 
The  quantity 


Fs(t;d)dt 


(37) 


is  the  integrated  shear  flux  which  has  penetrated  into  the  region  2  <  —d.  Note  that  the  initial 
value  of  the  shear  for  z  <  0  and  z  >  0  is  zero,  but  the  total  initial  shear  is  infinite  because  of 
the  discontinuity  in  the  initial  velocity  profile  at  z  =  0. 


4.3  Analytical  Solution  for  z  =  0 

For  z  =  0_,  inverse  Laplace  transforms  are  found  in  or  deduced  from  a  table  in  [1].  Prom 
equation  (31), 


6(0  ,p)  =  - 


1  1 

a  ^/p  +  a 


and  one  obtains 


B(0~  ,T)  =  zr^o-,?)]  = - \=  +  e“2Terfc(a\/r). 

aV7rT 


This  is  converted  to  the  original  time  t  using  equation  (22): 

Differentiating  equation  (31)  with  respect  to  z  and  evaluating  at  z  =  0“  gives 

Mo~,p)  =  - 

M0-,i)  =  - 

hzzz(  o  ,p)  = 


l _ 

p  +  ay/p 

a 


=>  Bz{ 0  ,t)  =  -eQ2f3/3erfc(-^  t3/2), 

P(^p  +  a)  -  =  e<*!‘>/3erfC<^  ‘3/2)  “  l’ 

a2 


pip  +  ay/p) 


Bzzz{ 0  ,t)  =  1  —  e“2‘3/3erfc(-^=  t3/2)  -  2a 


(38) 


(39) 


(40) 
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The  solid  lines  (labelled  d  =  0)  in  Figure  1  show  the  quantities  Fe,  Fs ,  E,  and  S  calculated 
using  the  results  of  this  section.  Fe  peaks  at  the  nondimensionalized  time  t  ~  0.62;  for  the 
typical  values  quoted  in  Section  3.1,  this  corresponds  to  about  a  week  after  the  storm.  From 
Figure  1(b)  and  using  the  fact  that  whatever  energy  flows  through  z  =  0~  must  have  initally 
been  in  the  mixed  layer,  we  see  that  by  t  =  1  (approximately  11.5  days  after  the  storm) 
nearly  half  of  the  energy  associated  with  horizontal  NIO  currents  caused  by  the  storm  has 
left  the  mixed  layer.  By  t  =  2  (approximately  23  days  after  the  storm)  82%  has  left,  while 
by  t  =  3  (just  over  a  month  after  the  storm)  93%  has  left.  Although  this  simplified  model 
cannot  be  expected  to  capture  the  full  complexity  of  a  real  storm  over  the  ocean,  it  does  give 
reasonable  estimates  for  the  time  scale  for  which  the  decay  of  NIO  energy  occurs:  [8]  found 
that  the  mixed  layer  inertial  energy  was  reduced  to  background  levels  by  21  days  after  the 
storm.  Both  the  shear  flux  Fs  and  the  integrated  shear  S  increase  monotonically  with  time, 
an  artifact  of  the  initial  velocity  profile  being  discontinuous  at  z  —  0. 


Figure  1:  (a)  Eg^d),  (b)  E(t\d),  (c)  Fs(t,d),  and  (d)  S(t;  d)  for  different  distances  d  below 
the  base  of  the  mixed  layer.  These  show  instantaneous  and  integrated  fluxes  of  energy  and 
shear;  see  the  text  for  precise  definitions.  The  solid  lines  (labelled  d  =  0)  give  results  at 
z  =  0". 


4.4  Analytical  Solution  for  the  Mixed  Layer 

Expanding  B(z,t)  =  Bo(z,t)  +  e2B2{z,t)  +  0(e4)  for  the  interval  0  <  z  <  1, 

A0(y,z,t)  =  e-WBoMse-VBofrT), 
A2(y,z,t)  =  e-iytB2(z,t)=e-iytB2(z,T), 
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where  T  =  t3/ 3  as  before.  Equation  (18)  then  implies  that 

B2zzt  —  2^0  —  0  0  <  z  <  1. 


This  is  Laplace  transformed  to  give 

pb2zz  ~  B2zz(z, 0)  -  =  0,  (41) 

where  b2  =  £[B2]  and  bo  =  £[Bq}.  The  initial  condition  within  the  mixed  layer  is  Azz  —  e2, 
so  A2zz  =  1  at  t  —  0;  thus  B2zz(z,0)  =  1.  Now,  A  is  continuous  across  z  =  -0,  and  A  =  Ao 
to  leading  order  in  e.  This  implies  that  jBo(0+,T)  =  B(0~  ,T)  to  leading  order  in  e.  Also,  Ao 
and  hence  Bq  are  independent  of  z.  Laplace  transforming,  we  conclude  that  the  value  of  bo 
for  0  <  2  <  1  is  equal  to  b(0~,p).  Using  all  of  this  and  equation  (31)  evaluated  at  2  =  0"  in 
equation  (41), 


h  1  i  1 

^2  ZZ  0  r~  , 

p  2  a^/p  +  a 


B2zz  =  e“2t3/3erfc  (-^=t3/2) 


Thus, 

A2zz{y,  t )  =  e~iyt  e“2  *3  /3erfc(  tz'2).  (42) 


The  local  HKE  is,  to  leading  order,  \A2zz\2  =  |Azz/e2|2;  the  total  HKE  contained  within  the 
mixed  layer  is 


dV. 


Using  0?  =  i/ 2,  the  amount  of  HKE  per  unit  volume  within  the  mixed  layer  is 


From  [1], 


£ML  = 


erfcf^  t^) 


(43) 


e«2t3/*erfc(  4=  f3/2)  ~ 
v3 

=>  €ML  ~ 


fl  1 
V  tt  at3/2 


t  — y  00 


_6_ 

7Tf3 


t  -¥  OO. 


This  asymptotic  relationship  is  confirmed  in  Figure  2.  Since  A2Z2  is  independent  of  z,  to 
leading  order  in  e  the  shear  within  the  mixed  layer  is  zero. 
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Figure  2:  cml  gives  the  HKE  per  unit  volume  in  the  mixed  layer.  The  solid  line  shows 
the  exact  result  and  the  dashed  line  shows  the  asymptotic  result,  shown  both  for  linear  and 
logarithmic  axes. 


4.5  Solution  for  Other  Depths 

A  tractable  analytical  form  for  the  inverse  Laplace  transform  of  equation  (31)  for  z  ^  0  could 
not  be  found;  however,  it  may  be  inverted  numerically  as  described  in  this  section.  The 
inverse  Laplace  transform  is  given  by 


1  fexP(3+PT) 

B{z,  T)  =  — - — t  /  - r~L - d P’ 

27t on  JB  Jp  +  a 


Vp- 

where  B  is  the  Bromwich  contour.  Here  the  branch  cut  for  the  square  root  is  taken  to  be 
along  the  negative  real  axis,  and  the  principal  branch  is  chosen.  The  integrand  does  not  have 
any  poles  on  this  sheet  of  the  Riemann  surface.  Such  a  pole  would  satisfy  y/p  +  a  =  0,  and 
would  be  given  by  pv  —  a2  =  ei7r/2/2;  however,  ^Jp p  +  a  =  y/2em^  ^  0  and  thus  there  is  no 
pole.  It  is  useful  to  make  the  change  of  variables  (valid  for  z  ^  0  and  T/  0) 


=  (-|02/3»,  «=(-? 


z  \  V3 


Then. 


B(z,T)  = 


rj1  f  exp[£(-^  +  w)] 


t  f 

2nai  JB 

r.2 


r)yjw  +  a 


TJ 


dw 


(44) 

(45) 

(46) 


Defining  the  contour  C  as  in  Figure  3  and  using  the  facts  that  the  contributions  from  C\  and 
C2  vanish  and  that  there  are  no  poles, 

[  g{w,^r])dw  =  {  [  +  /  +/  +[  \g(w^,rj)dw  =  0. 

Jc  Ub  Jab  J  bc  Jcb) 


For  the  path  AB,  w  =  rem  and 


Iab=  g(w;£,v)  =  / 

J  AB  J 1 


00 

r)i\fr  4-  a 


dr. 
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Figure  3:  Contour  used  to  determine  inverse  Laplace  transformations.  Here  A=(— oo,0+), 
B=(-1,0+),  C=(-1,0“),  and  D=(-oo,0“). 


For  the  path  BC,  w  =  et6  and 

Ibc  =  [  9{w,t,v)  =  -*  f 
J  BC  J-v 

Finally,  for  the  path  CD,  w  =  re~l7!  and 


*  exp[f  (— ae  lS/2  +  eld)]e%e 


I  CM  =  /  9{w\t,ri)  =  ~ 

J  CD  J 1 


T)  elS/2  +  a 
°°  e-rie-ioct,IVr 


de. 


dr. 


-r]iy/r  +  a 

Specifying  z  and  T  fixes  £  and  t).  The  integrals  Tab,  Ibc,  and  /cd  are  well-behaved  and  can 
be  calculated  numerically.  Then 


B{z,T)  =  2^^ab  +  ^BC  +  ^cd)- 


Differentiating  (31)  with  respect  to  z, 

w*>rt  -  -?r^exp(i)’ 

b”(z'r)  =  “sir^exp(i)’ 

W"’P)  =  "p&,'+ av ®  6XP  (S)  ' 

Using  the  contour  in  Figure  3,  we  find 

4(z’r)  =  I^'ab  +  4c  +  4d>. 
®“(z'T)  =  l£(44+42,U4£>. 
=  ^(4b  +  4c+4d). 
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where 


e-rteia£/y/r 

~rj2r  +  r]ai^/r 


dr, 


exp [£(-ae  l9/2  +  el0)\ 
T}2  +  r)ae~t0/2 


(i)  _  r°°  e-^e-iaC/^  (2)  /•«> 

CD  A  772r  +  rjod^Jr  r’  AB  r]zir3/2  +  rj2ar 


exp[£(— cte  ie/2  +  et8)]  ,g 
r?3e*0/2  +  r;2a 


g-rfg-iaC/V^ 
_^3jr3/2  •ff'OLT 


dr, 


e-r£,eiail- s/r 
r/4r2  —  i)3mr3/2 


dr, 


exp^(-ae  t<?/2  +  etg)]  ,, 

^4  ei0  _|_  arj3ei0/2 


e-r(,e-ia£!Vr 
rfr2  +  arfir3/2 


dr. 


All  of  these  integrals  are  well-behaved  and  can  be  calculated  numerically. 


4.6  Results 

4.6.1  Fluxes  and  Integrated  Fluxes 

The  quantities  FE{t;d),Fs{t]d),E{t-,d),  and  S(i;d)  may  now  be  calculated  and  results  are 
shown  in  Figure  1.  From  Figure  1(b),  as  noted  in  Section  4.3,  by  t  =  1  nearly  half  of  the  total 
horizontal  near  inertial  energy  has  left  the  mixed  layer;  only  about  38%  of  the  total  energy 
has  penetrated  below  z  —  - 1.  By  t  =  2,  82%  of  the  total  energy  has  left  the  mixed  layer, 
but  only  58%  of  the  total  energy  has  penetrated  below  z  —  —  1.  Thus,  at  t  =  2  (using  the 
typical  values  quoted  in  Section  3.1,  about  23  days  after  the  storm)  nearly  a  quarter  of  the 
total  energy  is  contained  in  the  distance  Hm ;x  immediately  beneath  the  mixed  layer.  Figure 
1(d)  demonstrates  that  the  shear  tends  to  be  localized  just  below  the  base  of  the  mixed 
layer.  For  example,  by  t  =  5  the  integrated  shear  flux  which  has  entered  the  mixed  layer  is, 
in  nondimensional  units,  about  5.5.  The  integrated  shear  flux  which  has  penetrated  below 
z  =  —0.05  is  4.6,  and  the  integrated  shear  flux  which  has  penetrated  below  z  =  —1  is  1.35. 


4.6.2  Vertical  Profiles 

Using  the  expressions  from  Section  4.5  and  equation  (22)  it  is  now  also  possible  to  calculate 
vertical  profiles  of  physically  relevant  quantities.  Figure  4(a,b)  shows  the  vertical  dependence 
of  the  HKE,  u2  +  v2  =  \AZZ\2,  and  the  vertical  shear,  u2  +  v2z  =  \AZZZ\2,  at  different  times 
for  y  =  0.  For  both  quantities,  as  time  increases  the  instantaneous  distribution  becomes 
more  sharply  peaked  near  the  base  of  the  mixed  layer.  The  maximum  value  of  u2  +  v2 
increases  without  bound  as  time  increases,  but  the  maximum  value  of  u2+v 2  remains  bounded 
(asymptotically  approaching  unity)  because  of  energy  conservation.  Figure  4(c,d)  shows  the 
vertical  dependence  of  the  fluxes  Fe  and  F$- 
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Figure  4:  Vertical  profiles  of  (a)  u 2  +  v 2  and  (b)  u2z  +  v2  at  y  —  0  for  different  times  showing 
the  decay  of  energy  from  the  mixed  layer  (0  <  z  <  1)  and  resultant  behavior  in  the  interior 
(z  <  0).  Note  that  u2z  +  vz  =  0  in  the  mixed  layer  to  leading  order  in  e.  Also,  vertical  profiles 
of  (c)  F£(t ,  |z|),  and  (d)  F${t,  \z\)  for  different  times.  Note  the  different  vertical  scales. 


4.6.3  Back-Rotated  Velocity  and  Shear 

Finally,  consider  the  back-rotated  velocity  Azz  =  el^ot(u  +  iv).  This  filters  out  the  purely 
intertial  motion  at  frequency  /o-  Similarly,  the  back-rotated  shear  is  defined  to  be  Azzz  = 
elfoi(uz  +  ivz).  The  amplitudes  of  the  back- rotated  velocity  and  shear  at  different  depths 
are  shown  as  time  series  in  Figure  5.  At  a  given  depth,  the  magnitude  of  the  back-rotated 
velocity  reaches  a  peak  value  shortly  after  the  storm,  then  decays  away,  while  the  magnitude 
of  the  back-rotated  shear  increases  monotonically  with  time.  Note  that  Figure  4,  may  also 
be  interpreted  in  terms  of  the  back-rotated  velocity  and  shear. 

Back-rotated  velocity  and  shear  may  be  represented  by  hodographs  which  respectively 
show  the  vectors  (Re(Azz),lm(Azz))  and  (Re(Azzz)^lm(Azzz))  as  curves  parametrized  by 
time.  For  /o  >  0,  if  these  curves  are  traced  out  in  a  clockwise  (counterclockwise)  fashion, 
the  corresponding  motion  has  frequency  larger  (smaller)  than  /q.  Figure  6  shows  the  back- 
rotated  velocity  and  shear  at  y  =  0, 2  =  —  1.  The  hodographs  for  both  quantities  start  at 
the  origin  and  are  traced  out  in  a  clockwise  fashion.  The  back-rotated  velocity  starts  out 
in  the  third  quadrant,  reaches  a  peak  in  magnitude  in  the  second  quadrant,  then  decays 
in  magnitude  spiralling  back  to  the  origin.  The  back-rotated  shear  also  starts  out  in  the 
third  quadrant,  and  monotonically  increases  in  magnitude  while  spiralling  outward.  Note 
that  the  lines  labelled  z  =  —  1  in  Figure  5  give  the  radii  of  these  hodographs  as  functions 
of  time.  The  depth  dependence  of  the  back-rotated  velocity  is  seen  by  comparing  Figure  6 
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with  Figure  7(a),  where  both  have  y  =  0  and  thus  the  same  value  of  the  Coriolis  parameter. 
Qualitatively  the  results  are  the  same,  but  closer  to  the  mixed  layer  the  direction  change  of 
the  back-rotated  velocity  becomes  slower,  meaning  that  the  frequency  is  closer  to  /o-  An  idea 
of  the  latitudinal  dependence  is  seen  by  comparing  Figure  6  with  Figure  7(b,c).  At  y  —  1  the 
hodograph  is  traced  out  in  a  clockwise  fashion  as  for  y  =  0,  but  at  y  =  —  2  it  is  traced  out  in 
a  counterclockwise  fashion. 


Figure  5:  Time  series  for  \AZZ\  and  \AZZZ\  at  different  fixed  z  values. 


4.7  Asymptotic  Behavior 

Making  the  change  of  variables  (44),  the  inverse  Laplace  transforms  of  b,bz,bzz,  and  bzzz  can 
be  written  in  the  form 

1(0  -  [  f(w,rj)e^h{w)dw, 

JB 

,  ,  .  a 

h(w)  = - -=  +  w. 

\/W 

The  asymptotic  behavior  of  this  in  the  limit  of  large  £  with  t]  fixed  can  be  determined  by  the 
method  of  steepest  descents.  This  involves  determining  the  saddle  points  of  h(w)  (i.e.,  points 
satisfying  h'(w)  —  0)  and  deforming  the  contour  B  so  as  to  pass  through  each  saddle  point 
along  a  path  of  constant  Im(h(w))  such  that  Re(h(w))  has  a  local  maximum  at  each  saddle 
point  .  Supposing  that  there  is  a  simple  saddle  ( h"(wo )  +  0)  at  w  =  wq , 


r, ^  V2Trf(w0)e^h(-w°hl'r 

m  ~  \£h"(w o)!1^ 


(47) 


Here  7  is  the  angle  relative  to  the  positive  real  axis  at  which  the  path  satisfying  the  above 
conditions  passes  through  the  saddle.  Letting  h"(w 0)  =  aet<7 ,  7  =  —a /2  +  7r/2  or  7  = 
-a/2  +  37r/2;  the  appropriate  choice  is  determined  by  making  sure  that  h(w)  only  has  local 
maxima  at  the  saddle  points  and  no  where  else  along  the  deformed  contour. 

Taking  the  branch  cut  for  the  square  root  along  the  negative  real  axis,  for  this  problem, 
there  are  saddles  at 


gZ7T5  /  6 
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Re(4zz) 


Re(Azzz) 


Figure  6:  Back-rotated  velocity  and  shear  at  z  =  —  l,y  =  0.  In  these  and  the  time  series 
in  Figure  7,  the  solid  and  dashed  lines  show  real  and  imaginary  parts,  respectively.  The 
diamonds  are  drawn  at  t  =  0, 5, 10, 15, 20. 

with 

_  .  ,  3 .  .  .  3\/3  3. 

h(wi)  =  --i,  h(w  2)  — - - — 

ti'(wi)  =  3  *,  h"{w  2)  =  3e-i7r5/6. 


Figure  8  shows  the  contours  of  constant  real  and  imaginary  parts  of  h(w).  The  deformed 
contour  is  taken  to  pass  through  both  of  these  saddles,  and  passes  to  the  right  of  the  origin 
in  order  to  avoid  the  branch  cut.  Since  h{w\)  is  purely  imaginary  and  h(w2)  has  negative 
real  part,  it  is  immediately  seen  that  the  contribution  from  W2  will  be  exponentially  small 
compared  with  the  contribution  from  w\  in  the  limit  £  — >•  oo.  Using  7  =  7r/4  for  the  passage 
through  w\ ,  in  the  limit  £  — >  00  with  r)  fixed 

m  -  r1/2  /m  ei(-3^/2+7r/4>.  (48) 
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Re(j4zz), 
Im(j4zz)  0.2 


Re(^zz), 
Im(Azz)  0.2 


Figure  7:  Back-: 
— 1,  y  =  -2.  The 


Figure  8:  Contours  of  constant  R e(h(w))  (solid  lines)  and  Im (h(w))  (dashed  lines)  in  the 
complex  w  plane.  The  thick  lines  are  the  contours  passing  through  the  saddle  points.  The 
branch  cut  is  clearly  seen  along  the  negative  real  axis.  In  the  method  of  steepest  descents,  the 
Bromwich  contour  is  deformed  to  pass  through  wi  =  e-I7r/2/ 2  =  — i/2  with  angle  7  =  7r/4, 
tu  the  right  of  the  origin  avoiding  the  branch  cut,  and  through  —  et7r5/6/2  with  angle 


This  implies  that 


»*?)  /27T 


1/2  .i(-3|/2+7r/4) 


5, 


??2(1  +  «g) 

7r(l  +  ??2) 

rj(t  -  rj)  f—l/2  pi(— 3f/2+7r/4) 


7r(l  +  J72) 

•BZZ  ~  — 


C  e" 

r*'-e" 


1  +  irl  .  I**  t-l/2  J(- 3C/2+7T/4) 


Bz 


7r(l  +  t?2)  V  3 

77  ”  7  £-1/2  e*(-3C/2+7r/4)  _ 


Now,  using  £ 


7J7r(l  +  772) 

?y2T  =  rj2tz  jZ  and  taking  the  constant  rj  to  be  t/q,  along  the  “rays”  2 


U2+V2  -  \Azz\2-\Bzz\2~  (1+7?2)7n?2t3> 

u2  +  nz2  =  |i4*«|2  =  |B„*|2~7t^(1  +  jjg)<3, 

=  t2[Im(Bz)Re(B)  -  Re(Bz)Im(g)]lz=_7,3t3/3  - 
=  t2[Im(Bzz)Re(52)  -  Re(Bzz)Im(S2)]|z__^3t3/3  ~ 


These  asymptotic  relationships  are  confirmed  in  Figure  9.  A  more  useful  way  to  represent 
the  asymptotic  results  is  to  write  77 0  in  terms  of  2  and  t  and  then  draw  contour  plots  of 
quantities  of  physical  interest  in  the  (2,  t)  plane;  this  is  shown  in  Figure  10.  Note  that  £  is 
large  for  sufficiently  large  2  and/or  t.  For  example,  this  shows  that  in  the  asymptotic  limit 
for  constant  2,  as  time  increases,  u 2  +  v2  and  Fe  decrease,  while  u2  +  v2  and  Fs  increase. 


5  Conclusion 

In  this  paper,  a  simplified  model  has  been  developed  and  studied  for  the  decay  of  near-inertial 
currents  excited  in  the  mixed  layer  by  a  passing  storm.  This  decay  occurs  due  to  the  radiation 
of  downward  propagating  NIOs  into  the  interior  of  the  ocean.  The  main  assumptions  of  the 
model  are  that  the  background  flow  does  not  vary  in  the  longitudinal  direction  and  has  no 
associated  vorticity,  the  ocean  has  a  simple  (piecewise  constant)  buoyancy  frequency  profile, 
and  the  storm  has  moved  very  quickly  over  the  ocean  causing  a  horizontally  uniform  near- 
inertial  current  concentrated  in  the  mixed  layer.  The  /3  effect  is  included  in  the  analysis. 
Because  the  depth  of  the  mixed  layer  is  much  smaller  than  the  total  depth  of  the  ocean,  the 
problem  is  formulated  in  the  limit  of  an  effectively  infinitely  deep  ocean;  the  resultant  initial 
value  problem  is  solved  by  Laplace  transforms.  Analytical  and  numerical  results  are  given  for 
quantities  of  physical  interest  including  horizontal  kinetic  energy,  vertical  shear,  energy  and 
shear  flux,  and  back-rotated  velocity  and  shear.  Also,  asymptotic  behavior  is  determined  by 
the  method  of  steepest  descents. 
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Figure  9:  Comparison  of  numerically  calculated  (solid  lines)  and  asymptotic  (dashed  lines) 
results  along  the  ray  z  =  -t3/ 3,  i.e.,  tjo  =  1.  (a)  u 2  +  v 2,  (b)  u2  +  v2,  (c)  Fe,  (d)  Fs- 


Although  this  simplified  model  cannot  be  expected  to  capture  the  full  complexity  of  the 
aftermath  of  a  storm  passing  the  ocean,  it  does  capture  much  of  the  observed  behavior.  Most 
importantly,  the  decay  of  mixed  layer  energy  is  found  to  occur  on  the  appropriate  timescale 
(approximately  twenty  days).  It  would  be  interesting  to  compare  the  results  obtained  for  this 
simplified  model  with  observations  and  numerical  simulations.  Also,  from  both  a  computa¬ 
tional  and  a  more  philosophical  perspective,  it  would  be  interesting  to  compare  this  method 
of  solution  with  the  standard  approach  of  projecting  onto  normal  modes  (e.g.,  [2,  3]).  In 
the  latter,  the  decay  must  be  viewed  as  a  complicated  interference  between  normal  modes, 
while  in  the  method  presented  in  this  paper  it  is  more  naturally  viewed  as  a  radiation  prob¬ 
lem.  Extensions  to  a  more  realistic  ocean  and  storm  would  involve  including  a  more  realistic 
buoyancy  frequency  profile  (for  example,  the  profile  used  by  [10]),  considering  the  effect  of 
different  initial  velocities  (including  both  horizontal  and  vertical  structure),  and  considering 
the  effect  of  background  flow.  The  study  of  all  of  these  could  use  the  same  formalism  of  [18] 
and  an  approach  similar  to  that  presented  here. 
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Stirring  and  Mixing  by  Vortical  Modes 


Jennifer  MacKinnon 


Abstract 

Tracer  release  experiments  in  coastal  and  open  ocean  settings  have  revealed  unex¬ 
pectedly  large  isopycnal  diffusivities.  The  stirring  and  mixing  effects  of  small  scale  eddies 
(vortical  modes)  are  discussed  as  a  possible  candidate  for  enhancing  diffusivity.  A  simple 
analytical  model  of  vortical  modes  is  used  to  evaluate  their  mixing  potential. 


1  Introduction 

Recent  oceanic  dye  release  experiments  have  provided  a  venue  for  evaluating  existing  models 
of  passive  tracer  horizontal  dispersion.  In  the  Coastal  Mixing  and  Optics  (CMO)  experiment 
and  the  North  Atlantic  Tracer  Release  Experiment  (NATRE),  anthropogenic  dye  was  injected 
along  constant  density  surfaces  (isopycnals)  in  streaks  a  few  km  long.  Subsequent  surveys 
through  the  evolving  dye  patches  with  towed  instruments  were  used  to  look  at  the  qualitative 
nature  of  dye  patch  evolution  and  calculate  quantitative  measures  of  isopycnal  diffusion  [1,2]. 

In  the  CMO  experiment,  several  mid-water-column  dye  releases  were  conducted  between 
1995  and  1997  in  ~  80  m  deep  water  on  the  continental  shelf  south  of  New  England.  Post 
release  surveys  were  conducted  over  the  several  days  following  each  release.  By  assuming 
that  dye  streak  evolution  was  governed  by  a  balance  between  horizontal  diffusion  and  strain 
induced  stretching  [3],  Sundermeyer  98  calculates  an  observed  horizontal  diffusivity  of  .3-5 
m2/s  [1].  A  traditional  way  of  estimating  horizontal  diffusivity  in  the  ocean  is  by  looking 
at  the  enhanced  diffusivity  that  comes  from  combining  vertical  diffusion  with  shear  from  the 
internal  wave  field  [4].  Applying  this  method  to  the  measured  CMO  velocities  gives  diffusion 
estimates  that  are  a  factor  of  1-10  below  observed  values[l]. 

The  NATRE  experiment  was  conducted  in  open  ocean  1200  km  west  of  the  Canary  Islands 
during  May  1992  [2].  The  dye  was  sampled  five  times  during  several  subsequent  years.  Over 
this  extended  period,  the  qualitative  evolution  of  the  dye  agrees  reasonably  well  with  the 
model  of  horizontal  diffusion  presented  by  Garrett  83  [1].  He  proposes  that  a  dye  patch  will 
initially  be  stretched  into  long  twisted  streaks,  the  width  of  which  is  governed  by  a  balance 
between  an  effective  isopycnal  diffusivity  and  exponential  stretching.  The  observed  dye  at 
6  months  is  indeed  streaky  in  nature  and  calculations  give  an  estimated  effective  diffusivity 
of  3  m2/s.  The  Young,  Rhines  and  Garrett  shear  dispersion  model  applied  to  this  situation 
gives  an  estimate  of  .08  m2/s  [1].  For  longer  times,  Garrett  predicts  that  dye  streaks  will 
coalesce  into  a  more  homogeneous  patch  that  grows  with  an  effective  diffusivity  related  to 
the  Lagrangian  velocity  autocorrelation  time  scale.  During  later  dye  surveys  (1-2  years),  the 
tracer  has  expanded  to  encompass  hundreds  of  kilometers  and  is  more  homogeneous.  The 
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observed  diffusivity  for  this  scale  is  103  m2/s,  and  agrees  with  estimates  calculated  from  float 
velocities. 

In  both  experiments,  observed  diffusivities  on  1-10  km  scales  are  larger  than  predicted  by 
shear  dispersion  alone.  Both  Sundermeyer  and  Polzin  et  al.99  propose  that  the  small  scale 
eddies  known  as  vortical  modes  could  be  an  alternate  diffusion  mechanism.  In  brief,  vortical 
modes  are  thought  to  be  generated  when  a  vertical  mixing  event  creates  a  relatively  well 
mixed  patch  of  water  within  a  stratified  fluid  [5].  This  density  anomaly  adjusts  to  pressure 
and  Coriolis  forces  by  spreading  radially  outward  and  beginning  to  rotate  anticyclonically.  A 
steady  rotational  state  can  exist  until  the  anomaly  diffuses  away  or  some  event  or  instability 
breaks  it  apart.  Both  the  adjustment  and  equilibrium  phases  can  act  to  enhance  horizontal 
diffusivity. 

Sundermeyer  considers  dimensional  arguments  and  concludes  that  in  the  coastal  ocean 
at  least,  horizontal  spreading  from  vortical  mode  generation  and  adjustment  may  be  of  an 
appropriate  magnitude  to  explain  observed  diffusivities  [1].  Polzin  et  al.  evaluate  evidence 
of  vortical  modes  in  NATRE  by  looking  at  spectral  shear  and  strain  values  that  are  not  well 
explained  by  an  internal  wave  field.  Using  inferred  vortical  mode  spectra,  they  calculate 
estimates  for  shear  dispersion  and  stirring  contributions  to  effective  diffusivity  and  also  get 
results  potentially  large  enough  to  explain  observations^]. 

To  tackle  the  time  dependent  picture  of  diffusivity  due  to  vortical  modes,  we  start  with 
a  broad  qualitative  description  of  what  might  happen  to  an  initially  small  dye  patch  in  an 
ocean  where  small  eddies  randomly  appear  and  disappear  in  various  locations  near  the  patch. 
We  ignore  the  initial  vertical  mixing  event  that  generates  the  vortical  mode  as  beyond  the 
scope  of  this  project.  Instead  we  assume  that  we  start  with  a  round  (axisymmetric)  density 
anomaly  that  abruptly  appears  near  a  small  patch  of  concentrate.  Over  the  lifetime  of  a 
single  vortex,  a  small  dye  patch  located  nearby  feels  a  net  (center  of  mass)  displacement 
both  outward  due  to  the  initial  adjustment  and  around  the  vortex.  The  patch  also  feels 
a  distortion  due  to  the  stretching  effects  of  radial  shear  and  molecular  and  shear-enhanced 
diffusion.  On  a  longer  timescale,  the  patch  will  witness  the  appearance  and  disappearance 
of  many  vortices  appearing  at  different  positions  around  it.  Additionally,  as  it  grows  in  size 
it  eventually  becomes  large  enough  to  feel  several  spatially  separated  vortices  at  the  same 
time.  The  larger  the  number  of  vortices  felt,  the  more  their  net  displacements  will  cancel  out 
to  produce  little  net  patch  movement.  However,  the  net  movements  felt  by  smaller  segments 
of  the  patch  stretch,  twist,  and  fold  the  patch;  increase  gradients;  and  enhance  diffusion 
to  cause  the  patch  as  a  whole  to  grow  in  size.  We  seek  an  understanding  of  diffusion  that 
will  encompass  both  small  and  large  time  limits.  For  small  times,  we’ll  have  a  more  time 
dependent  story  of  patch  growth.  For  larger  times  and  larger  spatial  scales  we  hope  that 
the  net  effects  of  smaller  motions  can  be  parameterized  by  an  effective  diffusivity.  Our  goals 
hence  are  threefold:  to  understand  the  time  dependence  of  small  time  evolution,  to  estimate  a 
long  time  eddy  diffusivity,  and  to  figure  out  an  appropriate  time  scale  for  transition  between 
these  two  regimes. 

Toward  these  goals,  we  develop  a  a  simple  analytical  model  that  we  hope  replicates  some 
of  the  essential  stirring  and  mixing  characteristics  of  oceanic  vortical  modes.  Such  an  an¬ 
alytical  approximation  allows  us  to  look  at  the  role  small  eddies  may  play  in  the  different 
stages  of  horizontal  tracer  dispersion,  get  some  bounds  on  the  relative  importance  of  different 
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diffusive  mechanisms,  and  explore  how  these  results  are  functions  of  fundamental  problem 
parameters.  In  section  2  we  begin  by  looking  at  numerically  integrated  full  numerical  so¬ 
lutions  for  vortical  mode  generation.  •  Based  upon  these  solutions,  in  Section  3  we  suggest 
an  analytical  approximation  for  a  vortical  mode  field  and  extend  it  into  a  time  dependent 
stochastic  model.  In  section  4  we  evaluate  the  expected  effect  of  an  idealized  single  vortex 
velocity  field  on  the  evolution  of  a  dye  patch.  We  expand  the  time  scale  to  consider  a  time 
dependent  blinking  vortex  field  in  section  5.  Throughout,  we  try  to  integrate  different  ways  of 
looking  at  diffusion  and  consider  dependence  on  a  few  oceanographic  parameters  of  interest. 
Finally,  in  section  6  we  re-dimensionalize  our  results  and  make  some  simple  comparisons  to 
the  observed  ocean  values. 

2  Vortical  Mode  Solutions 

Before  we  can  develop  an  appropriate  analytical  model,  we  must  first  get  a  better  feeling 
for  physically  realistic  vortex  velocity  fields.  We  start  with  the  generation  of  the  density 
anomalies  that  become  vortical  modes.  We  then  numerically  solve  for  the  equilibrium  solu¬ 
tions  that  describe  the  velocity  field  after  an  adjustment  period.  Finally,  we  consider  physical 
arguments  for  time  dependence. 

2.1  Generation  Possibilities 

Many  observations  have  shown  diapycnal  mixing  in  the  ocean  to  occur  episodically  [7].  En¬ 
ergetic  events  such  as  internal  wave  breaking  and  wave  wave  interaction  can  lead  to  shear 
instabilities  and  overturning.  Such  mixing  events  produce  local  regions  of  relatively  unstrat¬ 
ified  water.  Most  observations  of  mixed  patch  height  in  the  CMO  coastal  area  range  from 
2-10  m  [1].  In  the  NATRE  experiment,  observed  vertical  patch  sizes  were  much  larger,  on 
the  order  of  10-50  m  [6].  It  is  difficult  to  measure  the  horizontal  extent  of  mixing  events.  A 
starting  guess  is  that  the  mixed  patches  have  roughly  the  same  aspect  ratio  as  the  internal 
waves  that  generate  them,  N/f  (where  N  is  the  buoyancy  frequency  and  f  the  local  inertial 
frequency).  Microstructure  sections  made  during  the  CMO  experiment  show  mixing  patches 
on  the  order  of  a  kilometer  in  horizontal  extent  [8]. 

Following  Garrett  and  Munk,  we  estimate  the  frequency  of  vertical  mixing  events  by 
considering  net  observed  vertical  diffusivity[l].  We  assume  that  vertical  diffusion  of  tracer 
is  due  to  the  sum  of  discrete  identical  vertical  mixing  events  which  occur  with  a  frequency 
i/,  are  characterized  by  height  h»,  and  by  stratification  change  AN2.  Using  potential  energy 
arguments  we  can  infer  the  frequency  of  event  occurrence  given  an  observed  vertical  diffusivity 
of  Kz, 


u  = 


0  N2  1 
6  AN2  h2 


Kz 


(1) 


2.2  Adjustment 

A  density  anomaly  of  sufficient  size  in  a  stratified  fluid  will  evolve  according  to  the  pressure 
and  Coriolis  forces  it  feels  until  a  state  of  balance  is  achieved.  In  studying  the  equilibrium 
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solution  reached  after  adjustment  of  an  initial  density  anomaly,  we  follow  McWilliams  tactic 
of  combining  the  thermal  wind  equations  with  conservation  of  potential  vorticity  and  mass 
[9].  He  starts  by  non-dimensionalizing  as  follows: 


9d  =  }Nlq 

ed  = 

r<i  =  l*r 
Zd  =  Kz 
Pd  =  p*v*fh 
p*N?h* 

Pd  =  — P 

9 

Vd  =  v*v, 


where  the  subscript ’d’  indicates  the  dimensional  form  of  a  variable.  Variables  without  this 
subscript  (now  and  throughout  this  paper)  axe  assumed  to  be  dimensionless.  Variable  q  is 
Ertel’s  potential  vorticity,  9  is  the  density  anomaly  (deviation  from  constant  stratification) 
and  the  other  variables  have  their  traditional  meaning  (radial  and  vertical  distance,  pressure, 
density  and  velocity  respectively) .  Important  non-dimensional  parameters  are  given  by 


R 

B 
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4v* 

lu 

,i\u*  2 

(  fU  j 
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4:B' 


R  and  B  are  the  Rossby  and  Burger  numbers,  can  be  shown  to  be  a  measure  of  the  strength 
of  the  density  anomaly,  with  initial  density  and  stratification  profiles  given  non-dimensionally 
by 


Pi 

Nf 


-z  +  139 


(We  use  (3  instead  of  McWilliams  7  because  the  later  is  used  as  a  stretching  rate  in  later 
sections.)  Following  McWilliams,  we  assume  an  initial  density  anomaly  of  the  form 

6  =  2  ze~^+{-TlT°P\  (2) 


At  this  point,  parameters  ro  and  B  both  describe  an  aspect  ratio  of  the  problem.  Without 
loss  of  generality,  we  can  set  B=1  and  vary  ro- 

For  simplicity,  we  adopt  these  non-dimensionalizations  for  the  remainder  of  our  paper. 
Physically,  all  horizontal  distances,  angular  velocities  and  times  discussed  are  fractions  of 
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the  internal  Rossby  radius,  intertial  frequency  and  inertial  period  respectively.  To  convert 
any  result  back  into  dimensional  coordinates,  one  only  needs  to  specify  f,  N  and  an  anomaly 
height  scale  h*.  Dimensional  quantities  are  then  given  by 


Nh 

rd  =  -jKr 

(3) 

ft, 

II 

« 

^0 

(4) 

1 

i**~. 

ii 

(5) 

W'o'll  return  briefly  to  the  world  of  dimensions  at  the  end  of  the  paper  to  compare  our  results 
with  oceanographic  data. 

The  actual  process  of  geostrophic  adjustment  will  involve  internal  waves  which  radiate 
energy  away  in  a  time  roughly  given  by  1//  [10].  After  the  transients  have  disappeared,  the 
system  will  achieve  an  equilibrium  state  where  Coriolis,  pressure  and  centrifugal  forces  are 
m  balance.  Equilibrium  density  and  potential  vorticity  are  given  by 
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R  dp 
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Z  =  1  + 


R  d(rv) 
4  r  dr 


Knowledge  of  one  variable,  p(r,z ),  is  enough  to  specify  the  whole  system.  This  final  state 
is  related  to  the  initial  state  through  the  net  displacements  felt  by  each  Lagrangian  water 
parcel.  Non-dimensional  displacement  variables  are  defined  as 


C  (rf,Zf)  =  rj-n  (6) 

V(rf,zf)  =  Zf-Zi .  (7) 

These  parcels  conserve  their  density  and  potential  vorticity  values  during  adjustment 

Pf(r,z)  =  pi{r-C,z-i i)  (8) 

Qf(r,z)  =  qi(r  —  £,z  —  rj).  (9) 
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Figure  1:  Pre-adjustment  density  field  used  for  numerical  calculations.  The  full  density  field 
is  shown  in  (a),  the  density  anomaly  in  (b),  and  a  vertical  profile  at  r  =  0  in  (c).  Also  shown 
in  (c)  is  a  reference  linear  stratification  profile.  All  values  are  non-dimensional. 

Plugging  forms  for  pi,Pf,qi,qf  into  equations  (8)  and  (9)  gives  us  two  equations  and  three 
unknowns  (p,C>7?)  •  The  final  equation  comes  from  requiring  that  the  adjustment  be  incom¬ 
pressible,  or  equivalently  that  the  Jacobian  of  the  lagrangian  transformation  is  identically 
one: 

n(£ =  (io) 

rf  orfozj  ozfdrf 

where  ri,rf,Zi,Zf  can  be  written  in  terms  of  (  and  77  following  equations  6-7.  As  McWilliams 
suggests,  an  iterative  method  is  necessary  to  solve  the  non-linear  set  of  equations  8-10  for 
p,£,77-  There  are  only  two  free  parameters  in  this  system,  ro  and  fi.  which  control  the  aspect 
ratio  and  strength  of  the  initial  anomaly,  respectively.  Physically,  ro  =  1  corresponds  to  an 
anomaly  with  aspect  ratio  f/N.  Larger  ro  corresponds  to  a  flatter  anomaly  and  vice  versa. 
The  anomaly  strength  is  controlled  by  /3,  which  ranges  from  0  (no  anomaly,  or  background 
stratification)  to  .5  (minimum  of  zero  stratification).  Figure  1  shows  an  initial  density  field 
with  ro  =  1,  0  =  |,the  associated  density  anomaly,  and  a  profile  of  density  at  r  =  0  (with 
linear  stratification  for  comparison).  These  values  of  ro  and  /3  will  be  used  as  a  good  first 
estimate. 

We  numerically  integrated  solutions  to  equations  (8),  (9),  and  (10)  1.  Boundary  conditions 

1  Numerical  integration  was  done  using  routines  AVINT  and  HSTCYL,  available  from  the  NIST  Guide  to 
Available  Math  Software 
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Figure  2:  Equilibrium  solutions  for  the  density  anomaly  (a)  and  angular  velocity  (b). 
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As  expected,  equilibrium  solutions  consist  of  a  slumped  density  anomaly  rotating  anti- 
cyclonically.  Figure  2  shows  non-dimensional  equilibrium  density  and  angular  velocity  fields. 
Above  and  below  the  main  anomaly,  there  are  smaller,  cyclonically  rotating  vortices  that 
correspond  to  regions  of  enhanced  stratification  that  border  the  well  mixed  patch.  We  ignore 
these  smaller  vortices  for  now  and  hope  to  come  back  to  them  in  future  work. 

Figure  3  shows  profiles  at  z  =  0  of  radial  adjustment  displacements,  £-  and  equilibrium 
angular  velocities.  Adjustment  displacements  are  largest  at  the  edge  of  the  initial  anomaly 
(r  =  7'q  =  1),  and  have  a  maximum  value  that  approaches  1  as  ro  becomes  large.  Angular 
velocity  is  roughly  given  by  solid  body  rotation  out  to  r  ~  .5,  and  exponentially  decays  for 
larger  r.  Also  shown  is  an  analytical  approximation  of  velocity  that  will  be  used  in  later 
sections.  Further  dependence  on  parameters  ro  and  /3  is  considered  by  McWilliams.  The 
effect  of  varying  these  parameters  on  our  diffusion  calculations  may  be  discussed  in  future 
work. 


2.3  Time  dependence 

As  isolated  vortical  mode  will  exist  in  its  equilibrium  state  until  the  density  anomaly  diffuses 
away,  another  vertical  mixing  event  occurs  on  top  of  it,  or  it  succumbs  to  some  type  of 
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Figure  3:  Outward  adjustment  displacements  (a)  and  equilibrium  radial  velocity  (b)  at  z—0 
for  t  0  =  1  and  ft  =  ■  25.  Also  shown  are  distances  r\  and  r 2  which  are  the  edge  of  solid  body 
rotation  and  an  effective  edge  of  vortex  influence. 


instability  or  large  scale  strain.  The  time  scale  for  vertically  mixing  an  anomaly  of  height  h 
away  by  a  molecular  vertical  diffusivity  Kzm  is 


n  = 


h2 

Kzm 


(15) 


A  new  vortex  will  on  average  appear  at  a  particular  site  with  a  frequency  given  by  (1).  Since 
molecular  diffusivity  will  always  be  smaller  than  observed  diffusivity,  we  expect  the  maximum 
lifetime  for  vortices  will  be  given  by 


r  =  1/za 

For  the  coastal  ocean  observed  patch  size  of  2-10  m  and  observed  vertical  diffusivity  of 
10'  Jm2/s,  the  appropriate  dimensional  timescale  is  =  1  —  20  days  .  In  non-dimensional 
units,  fir  =  .01  —  .3.  In  most  cases,  significantly  less  than  one  rotation  is  completed.  In  the 
open  ocean,  patch  heights  of  10-50  m  and  diffusivities  of  .5-l*10-5m2/s  give  nondimensional 
Or  =  .3-15  (All  calculations  assume  0  =  .07.)  In  the  later  case  vortices  can  exist  in 
equilibrium  form  for  many  rotations.  This  difference  in  the  time  scale  of  vortical  modes  in 
different  regions  is  in  itself  a  major  result.  We  expect  that  the  ways  in  which  vortical  modes 
contribute  to  diffusion  may  be  significantly  different  in  these  two  situations.  Hence,  it  will  be 
a  primary  goal  of  subsequent  calculations  to  consider  how  vortical  mode  enhanced  diffusion 
both  qualitatively  and  quantitatively  depends  on  our  choice  of  r. 


3  Analytical  Model 

We  now  wish  to  consider  the  diffusive  effects  of  a  field  of  such  vortices.  To  make  our  task 
tractable,  we  incorporate  an  analytical  approximation  to  the  solutions  in  section  2  into  a 
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simple  Random  Renovating  Vortex  model  that  allows  us  to  stochastically  approach  diffusion 
quantities  of  interest.  We  posit  a  field  of  vortices,  each  of  which  appears  in  a  random  loca¬ 
tion,  exists  for  a  set  time  r,  identical  for  all  vortices,  and  disappears.  Associated  with  the 
appearance  of  any  vortex  are  instantaneous  adjustment  displacements  (  outward  from  the 
vortex  center.  During  the  subsequent  interval  r  of  vortex  existence,  water  parcels  follow  a 
steady  azimuthal  velocity  field,  fl(r).  For  simplicity,  we  basically  ignore  z  dependence  from 
now  on  (except  when  considering  vertical  shear  dispersion).  We  hope  a  first  order  picture 
of  the  dispersive  abilities  of  vortical  modes  will  emerge  from  considering  the  radial  depen¬ 
dence  alone.  Future  work  may  include  a  more  baroclinic  model.  The  numerical  solutions  to 
McWilliams  equations  suggest  analytical  forms  as  follows.  Let  the  azimuthal  velocity  field  a 
distance  r  from  the  center  of  the  vortex  be  given  by 


ft(r) 


r  <  ry 

£2oe-Q(r-ri)  r  >r\. 


(16) 


The  free  parameters  are  7*1,  the  edge  of  the  solid  body  rotation  part  of  the  vortex,  the 
velocity  scale,  and  a,  the  exponential  decay  rate  for  the  outer  vortex  velocity  field.  Fitting 
this  model  to  the  numerical  solution  shown  in  Figure  3  yields  r\  =  .5,  a  =  1.3,  and  fio  =  *07. 
Also  noted  on  the  graph  is  a  distance  7*2,  which  we  define  as  a  length-scale  of  vortex  influence. 
It  should  scale  roughly  with  the  exponential  decay  scale,  7*2  ~  1  +  1/a.  For  simplicity,  we  set 

7*2  —  2. 


Simplifying  further,  we  assume  a  parcel  at  any  given  location  feels  only  one  vortex  at 
a  time  and  that  vortices  do  not  interact  with  each  other.  Despite  these  caveats,  we  want 
vortices  to  be  evenly  distributed  in  space,  in  some  statistical  sense.  Hence,  we  propose  that 
the  probability  of  a  vortex  appearing  within  r  and  r  +  dr  of  a  given  parcel  is  given  by  the 
Holtzmark  distribution, 

P(r)  =  4>re“(r/r2)2  dr. 
r2 

Intuitively,  this  probability  is  proportional  to  the  area  of  the  strip  between  r  and  r  +  dr, 
multiplied  by  the  probability  that  there  isn’t  a  closer  (within  a  circle  r)  vortex.  The  proba¬ 
bility  is  normalized  using  the  vortex  spatial  scale,  7*2.  Finally,  we  assume  that  these  spatially 
uncorrelated  vortices  blink  in  and  out  of  existence  simultaneously  over  units  of  time  r.  At 
small  scales,  the  steady  flow  field  of  the  single,  closest  vortex  will  be  felt  during  each  time 
interval.  For  objects  of  larger  scales,  several  spatially  separated  vortices  may  be  felt  during 
each  time  step. 

To  account  for  the  variability  associated  with  different  realizations  of  vortex  placement, 
we  ensemble  average  many  quantities  of  interest.  For  any  quantity  which  is  a  function  of 
distance  to  the  vortex  center,  r,  we  define 

roc 

(. f(r))=  /  f(r)P(r)dr. 

Jo 

Ensemble  averaging  can  be  problematic,  as  no  particular  oceanic  realization  will  resemble 
the  smoothness  of  an  ensemble  average.  However,  such  averaging  is  necessary  to  make  our 
problem  tractable  and  we  hope  that  the  most  salient  features  are  preserved. 
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4  Short  Times  /  One  Vortex 

Using  this  model  we  can  consider  what  happens  to  a  small  dye  patch  during  short  times, 
in  which  it  feels  only  the  effects  of  a  single  vortex.  The  center  of  mass  of  the  patch  will 
be  displaced  as  a  point  parcel  would  be.  This  displacement  is  made  up  of  two  components, 
the  quick  adjustment  radial  displacement  and  the  slow  azimuthal  displacement  from  the 
equilibrium  velocity  field.  A  patch  will  also  be  stretched  by  radial  shear,  and  for  long  times, 
and  high  shear  could  be  wrapped  up  around  the  vortex  center  several  times.  Finally,  the  patch 
will  be  diffusing  all  this  time,  both  from  molecular  diffusion  and  under  some  circumstances 
from  an  enhanced  horizontal  shear  dispersion.  Each  of  these  items  will  be  considered  in  turn 
below. 


4.1  Outward  Displacements 

The  first  displacement  comes  during  the  adjustment  phase  and  is  given  by  ((r).  The  ensemble 
average  adjustment  displacement  felt  by  a  parcel  will  take  into  account  the  probability  of 
being  a  given  distance  away  from  the  closest  vortex  center  and  is  given  by 


(C(r)2)  =  /  C  2(r)P(r)dr  (17) 

w  .01  (18) 

where  the  approximate  form  was  obtained  by  integrating  the  numerical  ((r)  profile  shown 
in  Figure  3.  As  a  reminder,  these  numbers  should  be  scaled  by  ( h*N/f  )2  to  return  to 
dimensional  units  of  m2.  Since  we  don’t  have  a  good  analytical  approximation  for  ((r)  and 
since  it  doesn’t  vary  with  the  main  parameter  of  interest,  r,  we  aim  for  just  an  order  of 
magnitude  estimate. 

4.2  Azimuthal  Displacements 

The  average  azimuthal  displacements  felt  by  a  parcel  moving  around  a  vortex  will  depend 
not  only  on  the  distance  to  the  vortex  center,  but  also  on  the  length  of  time  r  it  has  to  move. 
For  a  given  r  and  r ,  the  displacement  is  shown  in  Figure  4  and  is  given  by 


i 2 

(?) 


2r2[l  -  cos(fi(r)r)] 


f 


P(r)2r2[l  -  cos  (fl(r)r)]dr. 


(19) 


With  f l(r)  given  by  16,  it  is  difficult  to  solve  for  (l2)  analytically.  However,  there  are  a 
few  limits  of  interest  that  are  more  approachable.  For  small  vortex  lifetimes,  we  can  Taylor 
expand(19)  in  powers  of  (fir). 
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Figure  4:  Azimuthal  distance  traveled  during  a  time  r  by  a  particle  a  distance  r  away  from 
the  center  of  the  closest  vortex  (V). 

Taking  only  the  first  expansion  term  and  integrating  gives 

(i2) « 2(ft0T)V(^)2r,  (21) 

where  we  define 

T  =  4[V^ar2ea2T'+2aTl+^/r2)2(S  +  2(ar2)2){ex{{-  +  av2)-l) 

4  *2 

+2((— )2  -  an  +  (ar2)2  +  1)].  (22) 

n 

The  upper  bound  on  (19)  is  obtained  by  noting  that 

1  —  cos  [fl(r)r]  <  2, 

and  thus 

«2>  +  rl)e-W\  (23) 

A  numerical  solution  to  equation  (19)  together  with  the  limits  given  in  (21)  and  (23)  is  plot¬ 
ted  in  Figure  5a.  The  outward  displacement  magnitude,  (2,  is  also  plotted  for  reference. 
Figure  5b  shows  expected  squared  displacements  divided  by  r,  which  is  related  to  the  dif- 
fusivity  expected  for  a  random  walk  process  of  given  step  length.  For  increasing  r,  average 
displacements  approach  steady  values,  but  expected  diffusivities  peak  and  then  decline. 

The  relative  importance  of  azimuthal  versus  radial  displacements  depends  on  the  magni¬ 
tude  of  r.  From  Figure  5,  the  two  effects  achieve  equal  magnitudes  for  Or  ~  .6.  In  coastal 
areas,  we  expect  outward  displacements  to  be  more  important,  and  vice  versa  for  the  open 
ocean.  More  precisely,  the  true  ensemble  average  displacement  is  {(l  +  C)2),  which  is  even 
less  analytically  tractable.  But  since 

(i2)  +  «2)<((i  +  02). 

calculating  them  separately  gives  us  an  upper  bound  on  average  displacement  magnitude. 
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Figure  5:  Numerically  integrated  average  squared  displacements  along  with  analytically  cal¬ 
culated  limits  are  shown  in  (a).  Also  shown  are  adjustment  displacements  squared  for  com¬ 
parison.  Random  walk  diffusivities  based  on  such  displacements  are  shown  in  (b). 


4.3  Azimuthal  stretching 

While  a  dye  patch  moves  around  a  vortex  center,  it  is  sheared  by  radial  gradients  in  velocity. 
The  direction  of  shear  is  likely  to  be  different  in  different  periods  of  time  r.  Hence,  in 
calculating  stretching  we  must  consider  not  only  the  distance  to  a  vortex  center  but  also  a 
dye  streak  orientation  with  respect  to  a  vortex.  The  increase  in  length  of  a  small  line  element 
during  one  time  interval  r  is  shown  in  Figure  6  and  given  by 


8s2  =  =  [1  +  tt  sin(2<^>)  +  (rr  cos  <j> )2]. 

Sq  dr  dr 

Ensemble  averaging  must  be  done  over  r  and  <f>.  We  assume  that  <f>  is  evenly  distributed 
between  0  and  2n.  After  N  period  of  length  r  have  passed,  a  streak  of  initial  length  so  will 
have  grown  to 


(s%) 


Equation  (24)  can  be  written 


-  c2 


4  imr,<t>)%)r]n 
^2 


=  4  Id  +  y(r  Sr 


**>)’>,  r 


=  s20  e~n(ri/r2)[l  +  ra2(fi0r)2]n. 

in  a  simple  exponential  form 


(24) 


(sw) 


e27t 


(25) 


202 


^0 


Figure  6:  Stretching  of  a  line  element. 


Figure  7:  Exponential  stretching  rate  felt  over  many  vortex  lifetimes. 


by  defining 

7  =  iln(l  +  Ta2(Qor)2)  -  (^)2].  (26) 

AT  12 

One  might  note  the  similarity  of  this  to  the  Lyapunov  exponents  calculated  for  the  Random 
Renovating  Wave  model  developed  in  the  principle  lectures.  The  stretching  rate,  7,  is  plotted 
in  Figure  7  as  a  function  of  fir. 

4.4  Diffusion/Shear  Dispersion 

As  a  dye  patch  stretches  and  moves  around  a  vortex  center,  its  area  will  increase  due  to  hori¬ 
zontal  diffusivity.  Molecular  diffusion  in  a  sheared  velocity  field  is  enhanced  by  the  interaction 
between  horizontal  or  vertical  shear  and  a  background  horizontal  or  vertical  diffusivity.  In 
a  steady  shear  field,  this  interaction  leads  to  an  anomalously  fast  diffusion  at  large  times[4]. 
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In  an  oscillatory  flow  field,  the  anomalous  diffusion  of  steady  shear  is  appropriate  only  for 
time-scales  smaller  than  the  oscillation  time.  Over  several  periods,  diffusion  regains  a  Fickian 
character,  with  an  enhanced  effective  diffusivity  which  is  averaged  over  the  oscillation  time 
of  the  anomalous  diffusion.  Young  et  al.  considered  the  vertical  shear  associated  with  typical 
open  ocean  values  of  the  internal  wave  field  and  found  that  horizontal  diffusion  was  enhanced 
over  it’s  molecular  values  by  a  factor  of  N2/f2.  Our  flow  similarly  contains  a  finite  time 
scale.  In  our  case,  shear  is  steady  for  the  vortex  existence  time,  r,  after  which  it  may  change 
direction  and  magnitude.  However,  unlike  the  Young  et  al.  model  in  which  the  advective 
solution  returns  to  it’s  original  state  at  the  end  of  each  oscillation,  our  flow  is  circular  within 
each  time  period.  Therefore,  Rhines  and  Young’s  work  on  two-dimensional  dispersion  for 
closed  streamlines  is  also  helpful[ll].  The  shear  in  their  case  is  radial  shear  of  a  circular 
flow.  To  get  the  best  shear  dispersion  estimate  for  our  problem,  we  start  with  the  Rhines 
and  Young  approach  and  add  a  vertical  component  of  shear  and  time  dependence. 

Following  their  approach,  we  start  with  the  advection  diffusion  equation  for  concentration, 
9  in  a  steady  velocity  field.  For  a  circular  velocity  field  with  azimuthal  velocity  given  by 

-Utj,  =  Cl{r,  z)  (27) 

T 

and  distinct  vertical  and  horizontal  diffusivities,  the  governing  equation  is 

6t  +  f 19^  —  Kn[-(r0r)r  -I-  ^2^]  +  KZ6ZZ  (28) 

If  we  assume  that  there  is  an  independent  internal  wave  field  with  a  faster  time  scale  super¬ 
imposed  on  our  vortical  mode  field,  a  plausible  relationship  between  Kh  and  Kz  is  given  by 

[4] 

N2 

Kh  =  jfKz.  (29) 

We  assume  the  solution  can  be  written  as  a  sum  of  components  with  different  azimuthal 
wavenumbers,  each  of  which  is  given  by 

9  =  A(r,z,t)ein + 

<j>  =  <f>  —  Q(r,  z)t. 

Plugging  (30)  into  (28)  and  grouping  terms  gives  an  equation 
change  in  the  framework  of  the  advective  solution 

1  n2 

A t  =  Kff[-(rAr)r  -(-  A]  +  KZAZZ 

-i{KH[-{rfi r)r  +  2Arnr]  +  Kz[A£lzz  +  2Az£lz]} 
r 

-{KHAQ2r  +  Kzn2z)}, 


(30) 

(31) 

for  the  rate  of  concentration 


(32) 
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Figure  8:  Non-dimensional  magnitude  of  vertical  and  horizontal  shear  from  the  full  numerical 
solutions  for  an  equilibrium  vortex  velocity  field. 


with 


Cl  =  ntCl. 

The  rhs  terms  of  (32)  are  grouped  by  powers  of  ( ntCl ),  which  is  physically  related  to 
distance  traveled  around  the  vortex  center.  Analysis  in  section  2  indicated  that  oceanic 
vortical  modes  can  have  existence  times  that  vary  from  much  shorter  to  much  longer  than 
their  eddy  turnover  time  such  that  we  are  interested  in  solutions  for  a  range  of  Cl  values. 
The  first  term  in  (32)  is  simply  molecular  diffusion,  and  in  the  limit  of  Cl  <C  1,  this  is  the 
dominant  term.  In  the  opposite  limit,  Cl  1,  the  third  term  of  (32)  dominates.  This  is  the 
limit  considered  by  Rhines  and  Young.  Following  their  analysis,  the  solution  to  (32)  for  large 
Cl  becomes 


A^e-Wt3lKH&T+Kztfz]'  (33) 

For  intermediate  values  of  Cl,  we  need  to  use  the  full  version  of  (32).  Such  precision  is 
beyond  the  scope  of  this  paper,  and  unnecessary  to  get  the  sort  of  magnitude  estimates  we  are 
interested  in.  The  above  approximations  suggest  that  for  vortices  with  short  lifetimes  (coastal 
regions),  vortical  shear  dispersion  is  not  significant,  and  we  can  define  an  appropriate  effective 
horizontal  diffusivity,  Ks  which  is  in  this  case  equal  to  the  traditionally  used  internal  wave 
enhanced  value  given  by  (29).  In  areas  with  long  lived  vortices  (open  ocean),  this  traditional 
value  is  likely  to  be  further  enhanced  by  vortical  mode  shear  dispersion.  Roughly  following 
methods  of  dealing  with  oscillatory  waves  in  Rhines  and  Young  and  Young  et  al.,  we  pull 
from  (33)  an  effective  diffusivity  of  the  form 

Ks  ~  i n2T2(K„Cl2r  +  KzCl2z).  (34) 
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Figure  9:  Schematic  of  different  growth  rates  that  characterize  tracer  distribution. 


Kh  is  given  by  (29)  and  Kz  is  perhaps  a  molecular  diffusivity.  Radial  and  vertical  shears 
from  the  McWilliams  model  are  shown  in  Figure  8.  The  reader  will  recall  that  these  shears 
are  non-dimensional  and  dimensional  forms  of  f2 2  and  G2  will  have  a  relative  scaling  factor  of 
iV2//2.  Hence,  the  roughly  comparable  magnitudes  shown  in  Figure  8  indicate  comparable 
importance  of  the  two  terms  in  (34). 

All  of  the  abovementioned  effects  (stretching,  shear  dispersion,  net  displacement)  occur 
at  the  same  time  for  a  dye  patch  feeling  a  single  vortex  field.  To  gain  some  insight  into  how 
these  various  effects  fit  together  into  a  more  coherent  picture  of  diffusion,  we  now  turn  to 
longer  time  scales. 

5  Longer  Times 

During  longer  time  periods,  a  dye  patch  will  experience  the  effects  of  multiple  vortices,  both 
because  we  are  considering  times  greater  than  an  individual  vortex  lifetime  and  because  over 
long  times  a  patch  grows  spatially  to  feel  many  vortices.  As  a  dye  patch  is  stretched  by 
several  randomly  oriented  vortices  in  turn,  it  stretches  and  folds  as  illustrated  in  the  cartoon 
in  Figure  9.  There  are  now  two  types  of  diffusion  of  interest.  First,  one  might  like  to  know  the 
size  of  the  bounding  circle,  designated  as  Ap  in  Figure  9.  This  represents  the  area  in  which 
one  has  a  chance  of  encountering  concentrate.  As  suggested  by  Garrett,  this  area  grows  as 
the  separation  between  discrete  parcels  in  the  flow  field[3].  The  second  quantity  of  interest  is 
the  actual  area  occupied  by  dye,  At,  which  will  depend  on  the  stretching  rate  7  from  equation 
(26).  We  now  approach  both  these  quantities  more  specifically  for  our  RRV  flow  field. 
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5.1  Multiparticle  Dispersion 

The  bounding  area,  Ap,  will  roughly  grow  like  the  size  of  a  group  of  discrete  water  parcels 
initially  close  together.  Qualitatively,  the  dispersion  rate  of  a  collection  of  particles  is  a 
function  of  how  close  they  are  to  each  other.  A  group  of  closeby  particles  will  all  be  feeling  a 
similar  velocity  field  and  will  move  more  or  less  together,  dispersed  only  by  the  small  velocity 
differences  between  their  positions.  As  they  move  apart,  they  feel  a  larger  velocity  difference 
and  move  apart  more  quickly.  In  our  case  there  comes  a  point  when  they  are  so  far  separated 
that  (on  average)  they  no  longer  feel  the  same  vortex,  at  which  point  their  velocities  become 
completely  uncorrelated.  At  this  point  each  particle  moves  in  its  own  random  walk  and  the 
particle  separation  distance  should  increase  as  twice  that  of  a  single  random  walker. 

Quantitatively,  the  area  of  a  group  of  particles  is  described  by  the  second  moment, 


p2 = \  e  (*<  “  ®)2 = l  ibx*  -  *2 > 

2=1  2=1 


(35) 


with 


1  n 

£ = ~y ]xi. 
n “ 

2=1 

Parcel  position  xt  at  any  time  t  can  be  obtained  by  integrating  the  Lagrangian  velocity  field 

Xi  =  Xio+  [  Ui{t')dt'.  (36) 

Jo 


Plugging  (36)  into  (35)  gives 

2 


=  a0  +  ^E  ««(*')<**' j 
-4EE /W  funds'. 

n  o  Jo 


(37) 


For  our  model,  t  =  Nt ,  where  N  is  the  number  of  vortex  lifetimes  experienced.  The  net 
displacement,  X{ (t),  is  simply  the  sum  of  displacement  from  each  successive  vortex 


[  Ui{t')dt' =  J2  f  uf\t')dt'. 

Jo  ^J  o 


Next,  we  simplify  by  ensemble  averaging  (37).  Consider  how  variance  increases  on  average 
during  one  period 

(°at)  —  (aN-i)  =  <-E[(  ui(j  )dt  |  ) 

-(4  EE  r  «<(<')*' 
n  0  Jo 
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which  can  be  re-written  as 


1  n  1  n  n 

=  ;D|!)-^EEw 

2=1  2=1  j  =  l 


(38) 


2=1 


Here  (Z2),  from  (19)  is  the  same  for  each  parcel  and  ( Irfj )  is  a  measure  of  correlation  between 
a  pair  of  parcels  and  must  be  summed  over  all  pair  combinations.  Physically,  (38)  states  that 
the  increase  in  particle  area  goes  like  the  single  particle  diffusivity  with  a  correction  term 
that  accounts  for  correlation  between  particles.  If  all  particles  are  moving  with  the  same, 
perfectly  correlated  velocity,  (klj)  =  (Z2),  then  no  relative  dispersion  occurs.  In  our  flow,  the 
degree  of  correlation  is  a  function  of  the  particle  separation,  (lilj)  =  ^r[(a^r_1)]. 


To  make  this  calculation  more  tractable,  consider  the  evolution  of  distance  between  only  two 
particles.  Let  (£2)  =  X2  -  xi  be  the  distance  between  two  parcels.  The  ensemble  averaged 
increase  in  (£2)  during  any  single  time  period  is  given  by 

(&>  -  {&-!>  =  2(/2)[l  -  (39) 

For  two  parcels  a  distance  £  apart,  (klj)  is  given  by 

pOO  p27T 

(/?■)==  j  J  P(r)2r(r  +  5r)y/ [1  —  cos  (f2(r)r)][l  -  cos  (Cl(r +  6r)r)]d(pdr:  (40) 

with 


Sr  = 


—  cos(</>)  +  r2. 


We  define  a  spatial  correlation  function 


TZij  — 


(klj) 
(l2)  m 


(41) 


As  is,  this  representation  assumes  that  as  particles  grow  further  apart  their  velocity  becomes 
less  correlated  but  they  still  feel  different  parts  of  the  same  vortex  field.  It  is  more  realistic  to 
say  that  significantly  separated  particles  will  feel  the  effects  of  uncorrelated  vortices.  From 
Section  2,  the  lengthscale  of  vortex  influence  is  We  manually  impose  this  constraint 
by  defining  a  new  correlation  function  in  which  particles  greater  than  r2  away  feel  different 
vortices 


k'(o  ^  mom), 


(42) 


where  /(£)  is  some  function  that  goes  from  1  at  r  =  0  to  0  for  large  r.  There  are  several 
forms  /(£)  can  take.  We  could  impose  that  correlation  abruptly  goes  to  0  when  the  distance 
exceeds  7~2,  so  /(£)  takes  the  form  of  a  Heavyside  step  function 
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Figure  10:  Two  different  possibilities  for  multiparticle  spatial  correlation. 


r  <  r  2 
r  >  r2- 


(43) 


This  is  straightforward  but  a  little  abrupt.  Another  possibility  is  to  make  the  transition  very 
smooth  by  assuming  that  the  probability  of  nearby  particles  being  in  different  vortices  goes 
from  0  to  1  linearly: 


MQ  =  1  -  j|-  (44) 

Both  possibilities  for  TV  are  plotted  in  Figure  10,  where  Rij  has  been  numerically  cal¬ 
culated  from  (19)  and  (40)  for  different  values  of  £.  Reality  is  likely  to  be  somewhere  in 
between.  The  second  choice  has  the  advantage  of  a  continuous  derivative.  It  also  provides 
an  approximate  lower  bound  for  parcel  spatial  correlation  and  hence  an  upper  bound  for 
diffusion  rate.  We  will  use  the  form  in  (44)  from  now  on. 

The  real  quantity  of  interest  is  the  time  evolution  of  £  for  particles  initially  separated  by  a 
distance  £o-  We  numerically  integrate  (39)  for  different  values  of  £o  and  r  and  plot  the  results 
in  Figure  11.  For  reference,  we  also  plot  the  separation  growth  expected  for  uncorrelated 
particles  based  on  (19)  alone,  assuming  (klj)  =  0.  At  large  times,  particle  separation  goes  as 
twice  the  single  particle  dispersion  rate,  (l2),  as  expected.  For  a  given  value  of  r,  the  initial 
particle  separation,  £o,  controls  the  time  it  takes  to  settle  into  linear  growth,  but  not  the 
qualitative  nature  of  the  transition  or  any  aspect  of  the  large  time  solution.  For  different 
values  of  r,  several  things  change.  First,  the  final  diffusivity  (the  intercept  of  our  log-log 
plot)  is  different,  as  expected  from  the  relationship  in  Figure  5.  Second,  the  transition  to 
linear  growth  has  a  different  character.  For  r  =  1, 10,  the  initial  growth  in  (£2)  is  slower  than 
linear,  while  for  r  =  100, 1000  it  grows  with  a  faster  than  linear  growth  rate. 
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£2 x  £2  x 


Figure  11:  Squared  pair  separation  distance  obtained  by  iterative  integration,  r  ranges  from 
1  to  1000.  In  all  subplots  the  solid  lines  represent  £o  values  of  .1,.5  and  1.  The  dotted  line  is 
t  he  growth  rate  expected  for  uncorrelated  randomly  walking  particles. 
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5.2  Streak  Area 


The  other  way  to  characterize  the  growth  of  a  dye  streak  is  by  the  actual  area  of  the  tracer. 
As  described  by  Ledwell  et  al.  and  Sundermeyer,  the  streak  will  go  through  an  initial  period 
of  stretching  during  which  it  gets  thinner,  cross-streak  gradients  get  larger  and  cross-streak 
diffusive  fluxes  grow.  Eventually  a  balance  between  stretching  and  diffusion  is  reached,  at 
which  point  the  streak  continues  to  grow  larger,  but  its  width  remains  the  same.  This 
equilibrium  width  is  related  to  the  stretching  rate  7  (26),  by 


where  Ks  is  an  appropriate  horizontal  diffusivity  (section  4.4). 

After  a  width  is  established,  the  streak  area  grows  with  its  increasing  length 


(45) 


(46) 


for  some  initial  length  lo- 

5.3  The  Big  Picture 

During  the  initial  period  of  stretching  and  folding  of  a  dye  patch,  the  containing  area  Ap  will 
be  significantly  larger  than  the  actual  dye  area  At.  However,  tracer  area  grows  exponentially, 
(46),  and  will  catch  up.  Physically,  as  the  tracer  area  depicted  in  Figure  9  grows,  different 
parts  of  the  tracer  streak  get  close  enough  together  that  they  start  to  merge  and  the  tracer 
fills  in  the  area  bounded  by  Ap.  From  then  on,  the  dye  patch  is  fairly  homogeneous,  and 
continues  to  grow  in  a  Fickian  manner  with  Ap.  In  Figure  12,  we  plot  Ap(t)  and  At(t)  together 
for  7  =  .008,  r  =  100,  £0  =  .1.  We  have  assumed  that  both  quantities  start  with  the  same 
initial  area, 


Aq  =  lo 


In  this  case,  tracer  area  is  much  smaller  than  the  bounding  area  up  until  a  time  Oro  ~  70, 
after  which  tracer  area  grows  linearly  as  Ap. 


6  Ocean  comparisons  and  Conclusions 

Our  goal  in  this  work  has  been  to  evaluate  whether  sub-mesoscale  eddies  could  make  a 
significant  contribution  to  horizontal  diffusion  in  the  coastal  or  open  ocean.  Because  it  was 
not  immediately  apparent  which  of  the  potential  stirring  and  mixing  abilities  of  vortical  modes 
would  be  important,  we  undertook  a  step  by  step  look  at  the  time  dependent  diffusion  of  an 
initially  small  patch  of  passive  tracer.  To  get  a  realistic  form  for  a  single  vortical  mode,  we 
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Figure  12:  Growth  rates  for  multiparticle  dispersion  and  tracer  area.  They  merge 


looked  at  numerical  solutions  to  the  non-linear  adjustment  of  an  isolated  density  anomaly, 
and  incorporated  an  idealized  version  of  these  results  into  a  simple  stochastic  renovating 
vortex  model.  Most  of  the  qualitative  features  of  qualitative  tracer  evolution  we  discuss  axe 
the  same  as  those  described  by  Garrett  for  a  two-dimensional  turbulent  field.  Using  our 
analytical  model,  we’re  able  to  calculate  exact  values  for  some  of  the  stages  he  talks  about 
more  qualitatively.  The  picture  that  emerges  is  that  at  space  and  time  scales  of  the  same  order 
of  magnitude  of  vortex  scales  and  lifetimes,  a  patch  of  dye  twists  and  folds  with  exponentially 
increasing  area,  filling  a  larger  bounding  area  governed  by  the  multiparticle  dispersion  rate. 
At  larger  space  and  time  scales,  the  tracer  area  grows  in  a  fickian  manner  with  the  effect  of 
vortical  modes  incorporated  into  an  effective  diffusivity. 

In  a  realistic  ocean,  the  added  complexities  of  stirring  processes  operating  on  a  variety 
of  time  and  space  scales  will  prohibit  comparison  of  the  distinct  stages  of  diffusion  we’ve 
discussed.  The  most  practical  results  to  emerge  from  our  work  axe  estimates  of  an  effective 
(Fickian)  diffusvity,  and  bounds  on  when  such  a  diffusivity  is  appropriate.  In  the  coastal 
ocean,  vortex  lifetimes  are  likely  to  be  shorter  than  their  turnover  timescale.  In  this  limit,  the 
displacements,  stretching  distortion  and  shear  dispersion  associated  with  azimuthal  motion 
may  not  be  as  important  as  displacements  from  the  initial  vortex  adjustment.  We  calculate 
a  non-dimensional  effective  diffusivity  for  this  random-walk  like  motion  of  .01  (Figure  5.)  To 
redimensionalize,  we  multiply  by  (N2hlf/3)/f2.  Using  N  =  10  cph,  /  =  1  day-1,  /?  =  .25,  and 
h*  =  3-10  m,  we  get  dimensional  diffusivities  of  up  to  .1  m2/s.  While  this  is  an  enhancement 
over  the  estimates  from  shear  dispersion  alone,  it  is  still  an  order  of  magnitude  smaller  than 
observations.  In  the  open  ocean,  vortices  may  exist  for  many  rotations,  in  which  case  the 
mechanisms  of  Section  5  are  more  appropriate.  Using  /i»  up  to  30  m,  we  calculate  dimensional 
diffusivities  of  order  1  m2/s ,  the  same  order  as  observations.  This  value  will  be  appropriate 
only  after  a  dye  patch  has  grown  to  scales  larger  than  the  vortex  scale  (approximately  an 
internal  Rossby  radius).  Our  analysis  suggests  the  time  necessary  to  get  there  is  highly 
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dependent  on  the  initial  dye  patch  size  and  the  vortex  lifetime  (Figure  11). 

There  are  numerous  additional  complexities  that  could  be  included  in  future  work.  First, 
given  the  importance  of  the  vortex  lifetime,  r  in  our  results,  we  should  try  to  get  better  es¬ 
timates  of  realistic  ocean  values.  We  should  consider  the  potential  effect  of  instabilities  as  a 
limit  on  expected  lifetime.  Second,  we  could  consider  a  model  with  a  continuous  distribution 
of  vortex  lifetimes.  Third,  we  could  use  a  the  full  baroclinic  vortex  form  suggested  by  the 
numerical  adjustment  results.  Such  an  approach  would  include  the  effects  of  cyclonically  ro¬ 
tating  vortices,  which  could  significantly  enhance  stirring  motions.  Finally,  we  could  compare 
our  analytical  results  with  full  numerical  simulations,  such  as  those  being  performed  by  P. 
Lelong  [12]. 
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Stationary  vortices  in  a  Keplerian  shear  flow 


P.  Garaud 


1  Introduction 

1.1  Planetary  formation 

Science  in  it’s  most  general  definition  began  as  a  quest  to  answer  the  fundamental  questions 
on  the  origin  of  humanity  and  it’s  relation  to  the  surrounding  Universe.  One  of  the  keys 
to  understanding  the  origin  of  Life  is  the  mechanism  of  formation  of  our  own  Solar  system, 
and  especially  the  formation  of  planets.  This  subject  has  gained  a  new  interest  in  the  past 
few  years  with  the  discovery  of  giant  planets  orbiting  some  of  the  nearest  neighbouring  stars. 
The  generally  accepted  theory  of  planet  formation  consists  in  the  following  steps: 

•  Due  to  the  onset  of  a  large  scale  gravitational  instability,  the  core  of  a  dense  molecu¬ 
lar  cloud  collapses  into  a  protostar;  the  conservation  of  its  initial  angular  momentum 
results  in  the  gradual  flattening  of  the  collapsing  gas  into  an  accretion  disk  around  the 
protostar. 

•  The  gas  in  the  accretion  disk  has  two  components:  a  molecular  gas,  composed  mainly 
of  H2  and  other  small  molecules,  and  a  dust  gas,  composed  of  particles  of  sizes  ranging 
from  a  few  microns  to  a  few  centimeters.  The  interaction  between  these  components 
takes  place  mainly  via  Stokes  drag.  The  vertical  stratification  in  the  accretion  disk 
relies  on  the  balance  between  pressure  and  the  vertical  component  of  the  gravitational 
force.  As  a  result,  since  the  thermal  pressure  of  the  dust  gas  is  much  smaller  than  that 
of  the  molecular  gas,  the  dust  settles  into  a  very  thin  disk  within  the  accretion  disk. 

•  The  dust  particles  then  coalesce  into  larger  and  larger  grains,  up  to  sizes  of  a  few 
kilometers;  as  they  grow  in  mass,  the  dynamics  of  these  “planetesimals”  gradually 
decouple  from  that  of  the  molecular  gas. 

•  The  planetesimals  continue  aggregating  into  planets.  Giant  planets  may  accrete  some 
of  the  molecular  gas  still  left  in  the  accretion  disk. 

However,  although  the  dust  aggregation  into  larger  grains  is  known  to  take  place,  the 
exact  mechanism  is  poorly  understood.  The  time-scale  for  this  aggregation  process  has  an 
upper  limit  of  a  few  Myr  (106  yr)  set  by  the  evolution  of  the  protostar  into  a  T-Tauri  star. 
Indeed,  T-Tauri  stars  are  observed  to  have  intense  magnetic  activity  and  strong  stellar  winds 
which  scatter  all  non-gravitationally  bound  dust  and  gas  into  the  interstellar  space.  Random 
encounters  of  the  dust  particles  due  to  thermal  agitation  is  not  sufficient  to  account  for  the 
growth  of  the  dust  grains  into  planetesimals  within  the  T-Tauri  evolution  time-scale. 
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In  an  attempt  to  remedy  this  problem,  it  has  been  shown  in  the  case  of  two-dimensional 
barotropic  turbulence  in  a  rotating  fluid  that  dust  particles  may  migrate  to  the  center  of 
anticyclonic  vortices  [1].  They  concentrate  there  for  the  lifetime  of  the  vortex.  As  a  result, 
provided  the  vortices  are  long  lived,  it  is  possible  to  greatly  increase  the  aggregation  rate,  and 
reach  the  required  sizes  of  dust  grains  before  the  T-Tauri  phase.  However,  it  is  not  yet  clear 
whether  a  Keplerian  flow  can  undergo  self-sustained  turbulence.  Indeed,  from  the  Rayleigh 
inflexion  theorem,  we  see  that  the  accretion  flow  is  stable  to  linear  shear  instability,  and  the 
latest  numerical  simulations  seem  to  indicate  that  the  primordial  solar  nebula  may  be  stable 
to  nonlinear  hydrodynamic  instabilities  too  [2].  It  has  been  shown  that  even  a  very  small 
magnetic  field  may  trigger  some  linear  instability  [3],  but  in  this  case  it  is  not  clear  how  the 
magnetic  forces  would  influence  the  existence  or  stability  of  the  vortices. 

Although  the  problem  of  hydrodynamical  stability  of  the  accretion  flows  is  not  yet  fully 
understood,  there  has  been  evidence  in  two-dimensional  decaying  turbulence  for  the  sponta¬ 
neous  apparition  and  the  persistence  of  vortices  on  time-scales  much  larger  than  the  turnover 
time-scale.  There  is  therefore  hope  in  the  Keplerian  case  that  even  if  the  turbulence  is  not 
self-sustained,  the  initial  anisotropies  in  the  flow  are  large  enough  to  create  these  long-lived 
vortex  structures.  The  work  presented  in  this  report  is  an  attempt  at  finding  steady  state 
solutions  for  vortices  in  Keplerian  accretion  flows.  If  these  solutions  exist  and  are  found  to  be 
stable,  they  would  explain  the  persistence  of  the  vortices,  and  therefore  solve  the  remaining 
dust  aggregation  problem. 


1.2  Mathematical  setup 

We  will  always  take  u  to  be  the  velocity  field,  ip  the  corresponding  stream  function  and  u) 
the  potential  vorticity.  In  the  work  presented  here,  we  have  chosen  to  simplify  the  problem 
greatly  by  considering  only  2-dimensional,  incompressible  fluid  motion.  As  a  result  of  this 
approximation,  we  can  now  write 

u  =  —Vx(ipez)  =  ez  x  Vip  and  u>  =  uiez  =  V2ipez  .  (1) 


We  will  consider  the  vortex  to  be  a  perturbation  on  the  main  Keplerian  accretion  flow. 
The  unperturbed  shear  flow  uk  is  given  by  the  Keplerian  rotation  law,  which  describes  the 
equilibrium  between  the  centrifugal  and  gravitational  forces: 


(2) 


where  R  is  the  distance  from  the  central  accreting  object  of  mass  M.  The  corresponding 
vorticity  is  wk  =  \\j ^r-  The  vortex  studied  will  be  placed  at  a  distance  Ro  from  the  center, 
at  9  =  0.  In  the  following  work,  we  will  often  have  to  change  from  the  polar  coordinate  system 
around  the  central  mass,  ( R,6 )  to  that  around  the  vortex,  (r,</?).  We  chose  to  take  cp  =  0 
where  6  =  0;  this  change  of  coordinate  is  represented  in  Fig.l. 

The  perturbed  vorticity  and  flow  are  represented  by  dashed  quantities.  The  equation  for 
the  evolution  of  the  vorticity  perturbation  is 


^  +  u  •  V«  =  ^  +  uK  •  V«'  +  u'  •  VwK  +  u'  •  W  =  0  .  (3) 

at  at 
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Figure  1:  Change  of  coordinate 


This  can  be  rewritten  in  cylindrical  coordinates  around  the  vortex  patch  as 

doj'  1  du'  1  Ou>k  dip'  ^  1  dip'  dw'  1  dco'  dip'  _  ^ 

dt  VKr  dip  r  dr  dip  r  dr  dip  r  dr  dip 

The  vorticity  perturbation  and  the  perturbed  stream  function  are  related  by 


j  =  vy  • 


(4) 

(5) 


1.3  Dimensionless  quantities 

In  order  to  simplify  the  expressions,  we  will  now  introduce  the  following  new  units  system: 

M  —  1,  Ro  =  1,  and  T0  =  1  (6) 


where  To  is  the  revolution  time  around  the  central  object  at  radius  Ro,  namely  To  =  2 Try  -^f-j 
As  a  result,  the  Keplerian  velocity  becomes  uk  =  2nR~1/2. 


1.4  Change  of  reference  frame 

We  will  be  looking  for  vortex  solutions  where  the  vortex  is  rotating  around  the  central  star 
with  a  Keplerian  velocity.  Steady  state  solutions  then  only  have  a  meaning  when  taken  in 
the  rotating  coordinate  frame.  We  use  a  frame  of  reference  rotating  with  velocity  which  is 
that  of  the  center  of  the  vortex  patch.  The  relative  shear  around  this  point  is  given  by 

vk(R)  =  2tt(R ~1>2  -  R)  (7) 


The  corresponding  stream  function  is 

*Pk(R)  =  27 r  (2{R}I2  -  1)  -  \{R?  -  1))  (8) 

Without  loss  of  generality,  we  have  chosen  i/>k(1)  =  0.  The  Keplerian  vorticity  and  its 
gradient  are  given  by 

wK  =  7 rlT3/2  -  47t,  and  =  ~^ttR~5/2  (9) 
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2  Top  hat  vortices  in  a  Keplerian  shear 

In  this  section  we  will  use  an  approximation  which  consists  in  neglecting  the  background 
Keplerian  vorticity  gradient  in  the  vorticity  equation.  This  approximation  is  valid  primarily 
for  small  vortices.  In  this  case,  there  exists  a  solution  of  the  steady  state  problem  with  oj' 
piecewise  constant.  We  will  therefore  try  to  find  solutions  of  the  type 

a /  =  VV  =  <7  inside  the  vortex  patch 

=  0  outside  the  vortex  patch  (10) 

Equation  (10)  can  now  be  rewritten  as 

V2ip' =  q'H(a  + q(p)  -  r)  (11) 


where  a  is  the  average  radius  of  the  patch,  and  77  is  the  departure  from  that  average.  We  will 
linearize  this  equation  by  considering  77  <  a,  so  that 

V2^'  =  qlLia  -  r)  +  <777(1^)5(0  -  r)  +  0(r)2)  (12) 


Replacing  J  in  equation  (4)  by  this  ansatz,  we  get  the  contour  dynamics  equation  (pro¬ 
vided  we  neglect  the  term  involving  the  vorticity  gradient) 


vk(t)  dq  ^  1  dip1  dr] 

r  dip  r  dr  dip 


(5  +  775')  +0(?72)  =0 


(13) 


where  6  =  6(a-r).  Taking  the  steady  state  part  of  this  equation,  we  integrate  it  once  across 
the  boundary  r  =  a  to  get 


dipK  dip'\  dq 
dr  dr  )  r=a  dp 


dip' 

dp 


+  0{  T?2)  =  0 


(14) 


The  condition  for  no  fluid  to  enter  or  leave  the  vortex  (which  defines  it  as  a  localized  vortex 
patch)  is  that  the  total  stream  function  should  be  constant  along  the  boundary 


^K+f) 


+  0(v2)  =  0 


(15) 


so  that  if  we  integrate  equation  (14)  along  the  boundary,  we  get 

i>K  (a)  +  ip' {a)  +  V  =  ipr,  (16) 

where  ipn ,  is  a  constant.  Since  this  implies  that  the  velocity  field  is  everywhere  parallel  to  the 
boundary,  there  is  no  net  force  exerted  on  the  vortex.. 
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2.1  Solution  to  zeroth  order 

Let  ipc  be  a  solution  of  the  zeroth  order  in  rj  of  equation  (12): 

VVc  =  qU{a  -  r)  .  (17) 

We  can  integrate  this  on  either  sides  of  r  =  a,  which  yields  the  solution 

QT^ 

ipc{r)  =  —£■  +  ci  lnr  +  C2  for  r  <  a 

=  C3  lnr  +  C4  for  r  >  a  (18) 

Regularity  at  the  origin  requires  that  c\  —  0,  and  we  can  choose  C4  =  0.  Note  that  the  stream 
function  diverges  at  infinity,  but  the  velocity  field  is  well  behaved.  Matching  the  function 
and  its  derivative  at  r  =  a  yields 


qa * 

~ 

+  C2 

=  C3lna 

qa 

C3 

(19) 

2 

a 

so  that  finally,  we  have 

-1) 

2 

'H(a  -  r)  +  H(r  -  a)  lnr  . 

Zi 

(20) 

2.2  Solution  to  first  order 

If  we  define  tp1  =  tpc  +  subtracting  equation  (17)  from  equation  (12)  yields 

V2V>  =  qr]((p)6(a  —  r)  .  (21) 

Write  that  ij>  —  and  77  =  t^ien  equation  (21)  becomes 

(22) 

so  the  solution  will  be  of  the  kind 

ipn(r  <  a)  =  oi(r/o)l"* 

tpn{r  >  a)  =  a2(r/o)_|n|  (23) 

provided  n  0;  we  have  implicitly  imposed  regularity  of  the  solutions  at  the  origin  and  at 
infinity.  Matching  the  solution  at  r  =  a  requires  that  a\  =  a2.  Finally,  integrating  equation 
(21)  across  the  discontinuity,  we  get 

For  n  ^  0,  this  condition  yields  ai  =  a2  =  —  We  therefore  get 

4  =  -  E  (W«)W«(“  -  r)  +  ( r/a)-W«(r  -  a))  ^  (25) 


The  case  of  n  =  0  is  discussed  in  the  next  session. 
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2.3  Matching  the  vortex  patch  to  the  Keplerian  flow. 

The  function  77  is  given  by  equation  (16),  which  corresponds  to  the  requirement  that  the 
shape  of  the  vortex  patch  remains  steady.  We  rewrite  it  here 

V’k(o)  +  +  V’c(a)  +  +  ^(°)  =  V’tj  +  0(t?2)  (26) 


The  term,  as  we  saw,  is  of  first  order  in  77.  This  equation  provides  all  the  7?n’s  but  770. 
A  last  condition  arises  from  the  normalization  of  the  total  vorticity  of  the  patch,  which  is 
equivalent  to  fixing  the  area  of  the  patch.  If  we  require  that  the  area  be  A  =  ira2  (i.e.  the 
area  of  a  corresponding  circular  patch),  then  we  have 


A 


r  r  rdrdV  =  r  + *  e  $ 

Jo  Jo  Jo  *  n>0 

7r  (a  +  7?0)2  +  0(t]2) 


(27) 


The  normalization  condition  on  A  is  therefore  770  =  0,  and  is  valid  to  first  order  in  77.  The 
only  terms  left  to  evaluate  are  V’k(o)  and  rj^^-(a). 


2.3.1  The  linear  shear  case 


Before  starting  on  the  Keplerian  shear  flow  case,  let’s  treat  the  simple  linear  shear  case;  in 
any  case,  one  would  expect  that  the  results  of  the  linear  shear  case  are  recovered  in  the  limit 
where  the  size  of  the  vortex  patch  a  is  much  smaller  than  the  distance  of  the  patch  to  the 
center  of  the  Keplerian  shear  flow  R. 

A  linear  shear  is  given  by  ul(jR)  =  s(R  -  1),  (taking  the  velocity  to  be  0  at  the  position 
of  the  center  of  the  vortex  patch)  so  that  the  corresponding  stream  function  is  iph(R)  — 
%{R  -  l)2.  Since,  to  a  first  approximation  R  =  1  +  r  cos  ip  +  0(r2),  we  have  V>L  (r)  = 


=  ^-{e2v4>  —  2  +  e  2lip).  The  matching  condition  then  yields 


^ (e2^  -  2  +  e~2iv)  +  (  r)neinifi  ]  ^(e2^  -  2  +  e~2^) 


sa , 

I 


(28) 


,  9°  1  ,|  imp  \  9°  97?nG 

+  _lna+|^,„e  -M  _  -  ^  , 

in^O  /  n^O 


tiTup 


i>r, 


An  important  point  is  that  the  first  term  in  that  expression  is  potentially  much  larger 
than  the  other  ones.  In  order  for  this  term  to  be  balanced,  one  requires  that  rjq  ~  sa.  Since 
7]  «  a,  this  condition  is  equivalent  to  q  >  s.  Hence  this  work  is  only  valid  for  vortex  patches 
with  vorticity  much  larger  than  the  local  Keplerian  vorticity.  We  also  see  that  the  second 
term  in  that  expression  is  of  order  of  77/a  compared  to  the  other  ones,  and  will  be  neglected 
in  the  coming  analysis.  As  a  result,  if  we  take  /Q27r  (28)  e~im<pd(p/2TV,  we  get 

qa2  sa 2  sa  . 

i>ti  =  — lnG  ~  and  9±2  =  -  g-  (29) 
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and  all  other  rjn  are  zero.  We  see  again  that  the  condition  77  <  a  is  equivalent  to  the  condition 
s«g.  We  therefore  have 

SCL 

rj(ip)  = - cos(2y?)  (30) 

9 

To  first  order  in  77,  this  corresponds  to  an  elliptical  shape1. 


2.3.2  The  Keplerian  shear  case 

As  a  first  approximation,  let’s  take  R  =  1  +  r  cos <p.  This  approximation  will  be  discussed 
later.  In  this  case,  if  we  define 


ipK(a)  =  ip K(1  +  a  cosip)  =  I^emv  (31) 

n 

we  have,  from  the  matching  condition  given  by  equation  (26) 

^  =  %  +  ^lna  (32) 

”  Ja  -  H) fot  w  >  1  (33) 

Because  of  the  symmetry  in  ip  — >•  —ip  of  the  Keplerian  shear  flow,  we  know  that  1%  —  I-n, 
which  is  confirmed  by  expression  (33).  The  case  n  —  1  corresponds  to  a  translation  of  the 
vortex  along  the  0-direction  (azimuthally  around  the  central  mass),  so  that  771  can  always 
be  taken  to  be  0  by  an  appropriate  change  of  referential2.  The  1%  are  given  by 


7?  =  f’lW l  +  acoSV)e-“^  (34) 

=  27 t  J  f2(\/l  +  a  cos^7  —  1)  —  t(a2  cos" <p  +  2a  cosip) 

In  order  to  solve  this  integral,  we  need  to  expand  it  as  a  Taylor  series  (which  is  necessarily 
convergent  since  we  have  a  <  1).  So 


—intp  d ip 

2tt 


/K  =  _na2  I*  cos2 6e-inV&P 

L  27 r 


r n  /9b. 


(35) 


where  we  define  =  v(y  —  l)(...)(z/  —  k  +  1),  and  (v)q  =  1.  Then 

%  =  -Wh,a  +  477  f;  (36) 

k—\n\ 

indeed,  the  equation  for  an  ellipse  being  r  =  b^J  1  -  a  cos 2ip/e  \  if  the  eccentricity  e  is  very  small,  then 
r  ^  6(1  +  e  cos29/2a)  =  ro  +  cos(2<£>). 

2 Indeed,  let’s  take  the  example  of  the  displacement  of  a  circular  patch:  the  equation  for  a  circle  centered 
on  x  —  7]  (instead  of  x  =  0)  is  ( x  -  r))2  +  y2  —  a2.  Expanding  this  to  first  order  in  r)  and  changing  coordinates 
from  (r,  y)  to  (r,  ip),  we  get  r  =  a  ■+■  -q  cos^,  which  corresponds  to  an  n  =  1  deformation  mode. 


221 


90 


0.3 


Figure  2:  On  the  left:  steady-state  shape  of  the  vortex  patch,  for  a  =  0.05,0.1  and  0.2  in  the 
Keplerian  shear  case,  with  the  approximation  R  =  1  +  r  cos  ip.  In  all  3  plots  q  =  1.  On  the  right: 
Ratio  of  the  3rd  and  4th  order  of  deformation  to  the  2nd  as  a  function  of  a. 

where  the  <7*  are  the  binomial  coefficients.  Finally,  we  compute  the  deformation  by  adding 
the  Fourier  coefficients  as 


viv) = 5Z 2rin  cos(n^)  (37) 

n>0 

Fig.  2  present  the  results  for  some  values  of  a  and  q.  One  can  however  guess  (and  check) 
that: 

•  The  larger  q,  the  smaller  the  deformation  from  a  circular  patch.  Since  q  only  appears 
in  the  “normalization”  of  7?  rather  than  in  the  relative  amplitude  of  the  modes  qn  of 
deformation,  changing  q  only  amounts  to  changing  the  total  amplitude  of  the  deforma¬ 
tion.  In  the  following  plots,  a  small  value  of  q  was  chosen  on  purpose  to  let  the  vortex 
deformation  be  more  easily  identifiable.  In  reality,  we  should  take  q  1  to  have  the 
required  7?  <C  a. 

•  On  the  other  hand,  the  value  of  a  will  influence  the  relative  importance  of  the  %,  and 
will  dictate  the  shape  of  the  vortex.  The  larger  a,  the  larger  the  higher  order  modes  of 
deformation,  and  the  more  difficult  the  convergence. 

We  can  see  in  Fig.  2  that  for  a  ( a/R0  in  real  units)  small,  the  dominant  term  in  the 
deformation  7?  is  772,  the  ellipsoidal  term.  The  ratio  of  the  3rd  and  4rd  order  deformation 
modes  to  the  second  is  also  shown.  As  we  can  see  from  this  plot,  for  a/Ro  <  0.1,  the  shape 
of  the  vortex  patch  is  very  well  approximated  by  an  ellipse. 
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2.3.3  Validity  of  the  approximation  R  =  1  +  r  cos <p. 

The  full  expression  for  R  is 

R  =  \/\  +  r2  4-  2r  cos  «  1  +  r  cos  <p  +  0(r2)  (38) 


therefore  the  approximation  is  only  valid  for  r«  1.  Let’s  study  again  the  example  of  the 
linear  shear  case:  this  time,  we  have 

ipi,(R)  =  ^(R  -  l)2  =  |  ^2  4-  r2  +  2r  cos  <p  —  2\ZT+r^~+2r”cos^^  (39) 

Following  the  same  procedure  as  before  (Taylor  expansion  +  Fourier  decomposition)  we  can 
obtain  the  Fourier  coefficients  of  'tph(a)-  successively 


so  that 


4> l(o)  =  — s^(a2  +  2a  cos tp) 


k>  1 


*(1/2)* 
k\ 


=  -sJ2  Ckak+p2k~p  cos  k'pip 


k>  1 


p= 0 


7-L  _  r  (1/2)/:  cih-psyk  k+p  1  fik-p 

In--s2^—^-L.2  CPa 


fc>l 


p= 0 


(40) 


(41) 


The  T]n  are  given  by  equation  (33).  The  resulting  steady-state  shape  of  the  vortex  patch  is 
shown  in  Fig.3  This  deformation  is  due  to  the  fact  that  the  flow  is  not  a  plane  parallel  flow, 
but  rather  curves  around  the  central  accreting  object.  When  the  size  of  the  patch  is  large, 
this  curvature  acts  to  deform  it. 

To  conclude,  if  we  interpolate  the  linear  shear  results  to  the  Keplerian  case,  it  is  likely 
that  the  approximation  will  fail  for  a  >  0.1.  The  full  expression  for  R  should  therefore  be 
kept.  The  results  for  the  Keplerian  shear,  using  equation  (38)  in  expression  (35),  are  the 
following: 

k 

I f  =  -47 r  2  £  ak+p2k-pCkCk-ln  (42) 

fc>l  '  p=0  2 

The  corresponding  vortex  patches  are  presented  in  Fig.  4,  for  both  a  cyclonic  and  an  anticy- 
clonic  vortex. 


2.4  Discussion. 

Assuming  that  the  background  vorticity  is  constant,  it  has  been  possible  to  calculate  the 
steady  state  shape  of  top-hat  (i.e.  constant  piece-wise)  vortices.  In  a  linear  shear,  it  is  a  well 
known  result  that  the  shape  of  the  vortices  should  be  elliptical  [4].  For  very  small  vortices, 
for  which  the  variation  of  the  background  vorticity  is  negligible,  we  could  expect,  and  saw 
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Figure  3:  On  the  left:  steady-state  shape  of  the  vortex  patch,  for  a  =  0.1, 0.2  and  0.4  in  the  linear 
shear  case.  In  all  3  plots  q  =  1.  On  the  right:  relative  amplitude  of  the  3rd  and  4th  moment  of 
deformation  compared  to  the  second,  for  varying  values  of  a. 

that  the  vortices  were  mainly  elliptical  in  shape.  However,  for  larger  vortices3,  there  is  a 
systematic  variation  from  elliptical,  and  this  has  two  main  causes:  firstly,  the  curvature  of 
the  Keplerian  flow  around  the  central  star,  and  secondly  (and  this  is  the  dominant  effect), 
the  background  variation  in  the  velocity  field.  The  next  step  in  this  analysis  would  be  to 
consider  the  stability  of  these  vortices,  in  a  similar  way  as  has  been  done  by  Meacham  et 
al.[ 5].  This  is  not  in  the  scope  of  this  project. 

3  Including  the  main  flow  vorticity  gradient 

In  the  previous  section,  the  vorticity  gradient  term  in  the  vorticity  equation  (4)  has  been 
neglected  in  order  to  allow  solutions  with  piece-wise  constant  vorticity.  However,  including 
the  vorticity  gradient  term  forbids  this  solution.  In  particular,  when  the  nonlinear  terms  can 
be  neglected,  we  will  see  that  there  exist  stationary  wave-like  solutions:  the  lee  waves.  These 
have  to  be  taken  into  account  in  their  interaction  with  the  vortex  patch.  In  the  following 
work,  we  will  therefore  study  two  main  regimes: 

•  Far  from  the  vortex,  the  perturbation  induced  by  the  vortex  is  small;  the  vorticity 
equation  becomes  a  linear  equation  for  stationary  lee  waves.  The  far  field  of  the  vortex 
can  then  be  obtained  by  studying  the  lee  waves  which  are  created  around  a  point  vortex. 

•  Near  the  vortex,  the  perturbation  is  much  stronger  than  the  Keplerian  flow.  By  rescal¬ 
ing  the  coordinate  system  to  emphasize  the  region  near  the  vortex,  we  will  see  that 
to  a  first  approximation,  the  gradient  of  the  vorticity  can  be  neglected.  This  zeroth 

3In  fact  the  approximation  of  constant  vorticity  around  the  vortex  is  no  more  valid  for  the  larger 
vortices  anyway. 
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Figure  4:  Deformation  of  a  vortex  patch  in  a  Keplerian  shear  for  a  =  0.05, 0.1, 0.2,  and  q  =  1. 

order  solution  resembles  closely  that  presented  in  Section  1  for  the  linear  shear.  The 
deformation  of  the  vortex  patch  is  then  given  by  the  next  order  in  the  approximation. 

In  the  steady  state,  equation  (4)  becomes 

J(ipK  +  ^)Wk  +  a/)  =  0  (43) 


where  the  Jacobian  J  is,  in  the  cylindrical  coordinate  system 


IdAdB  IdAdB 

RdRdG  Rd9dR 

(44) 

Let’s  now  use  the  new  coordinate  system 

0  =  -X 

R  =  exp(y) 

(45) 

The  Keplerian  velocity  and  stream  function  in  this  new  ordinate  system  is 
tions  (7),  (8),  (9)  and  T  =  3n) 

then  (using  equa- 

2T  (  _Iy  y\ 

vK  =  -[e  *  -e  ) 

(46) 

2r  f  ly  1  2 Y  3  \ 

=  y(2e2  "2e  "2 J 

(47) 

T  3y  4T 

WK  =  3 e'5  ~T 

(48) 

The  Jacobian  equation  then  becomes  (dropping  the  primes  on  the  perturbed  quantities) 


J  (f  (2e’y  -  b2Y  -§)+*■  r~ir  -  f + e_2yv>)  = 0  <49> 

where  we  now  have  J(A,B)  —  dxAdyB  —  dyAdxB,  and  the  Laplacian  operator  is  V2  = 
dxx  +  9yy- 
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3.1  The  far  field:  a  linear  approach 

3.1.1  Assumptions  and  equations 

We  will  assume  that  the  perturbation  is  much  smaller  than  the  Keplerian  stream  function. 
This  should  be  valid  everywhere  but  around  Y  =  0,  where  ipK  vanishes.  Equation  (49)  implies 
that 

|e-!K -y  +  e"2y  VV  =  F  y  (2eiy  -  y2y  (50) 

The  function  F  is  not  unique.  This  means  that  there  are  many  steady  state  solutions  to  the 
problem  we  are  considering,  each  of  them  depending  on  the  type  of  forcing,  the  symmetries 
required,  the  behaviour  far  from  the  vortex.  We  must  choose  the  function  F  carefully  to 
represent  the  physics  of  the  system  considered.  We  want  to  represent  the  presence  of  a  small 
vortex  patch,  and  its  influence  on  the  Keplerian  accretion  flow.  Far  from  the  vortex  (the 
region  considered  here),  we  hope  that  there  exist  solutions  in  which  the  disturbance  caused 
by  the  vortex  is  very  small,  so  that  the  Keplerian  stream  lines  are  merely  displaced  by  a  small 
amount.  Taking  these  two  ideas  in  consideration,  we  see  that  a  possible  prescription  for  F  is 

F(ipK  +  i>)  =  FkW’K  +ip)  +  Q6(X)S(Y)  (51) 

where  Q  is  the  total  vorticity  of  the  patch,  and  the  function  Fk  is  defined  as 

r  _3y  4T  2T  (  iy  1  2  V  3\  /rrk\ 

wk  =  Fk(V’k))  **  %e  2  — =  -Fk  —  -  -e  -  -J  (52) 

Putting  this  ansatz  back  into  equation  (50)  we  get 

e-2yVV  =  FK(ipK+ip)-FK(ipK)  +  Q6(X)5(Y) 

*  1>F^k)  +  Q6(X)6(Y)  (53) 

since  we  assumed  that  ip  <  V'K-  The  function  F^  can  easily  be  obtained  by  taking  the 
Y -derivative  of  equation  (52),  and  is 

doJK  o  .  x  _  i 

<54> 

dY 

We  finally  get 

(ef y  -  l)  V2iP  =  \i>  +  QS(X)5(Y)  (e£y  -  e2y)  (55) 

Note  that  equation  (53)  is  an  equation  for  stationary  waves,  the  lee  waves. 
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3.1.2  Localized  solutions  to  the  point  vortex  problem 

The  periodicity  in  X  suggests  the  expansion  i/t  =  ^m<?mX ■  The  symmetry  of  the  system 
as  X  -¥  -X  limits  the  sum  to  m  >  0.  In  this  case,  we  will  have  to  solve 


Asymptotically,  we  see  that  there  exists  solutions  to  this  equation  which  are  localized  in  the 
radial  direction: 

•  for  Y  >  1,  we  get 

e|y  | t™  _  (57) 

This  equation  can  be  solved  exactly  by  using  the  change  of  variable  t  =  e"y,  which 
leads  to  the  solutions 

(58) 

For  y  >  1,  t  ->  0  so  we  must  keep  the  I+im  solution  to  ensure  the  decay  of  the 
solutions.  Note  that  when  m  =  0,  there  is  no  decaying  solution.  This  will  be  discussed 
later. 


for  y  —1,  which  corresponds  to  the  center  of  the  accretion  disk,  the  equation  becomes 


( ) 


"4 


which  has  the  decaying  solutions 


ipm  ex  exp (yjrn2  - 


when  m^O.  When  m  =  0,  we  get  oscillatory  solutions. 
Near  the  point  vortex  (X,  Y  small)  the  equation  becomes 


( 


dVm 

dY 2 


m2ipi 


U- 

7  2 


V>m  +  y|U(y) 


By  changing  the  variable  to  t  =  aY,  we  get,  for  the  homogeneous  part 

d2ipm  m2  1  ,  _  n 


dt2 


a * 


2  at 


(59) 


(60) 


(61) 


(62) 
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which  corresponds  to  a  Whitaker  equation  with  coefficients  k  =  —  1/2 a,  and  /x2  =  1/4  (cf. 

2 

Abramovitz  &  Stegun  [6])  provided  (p-  =  1/4.  The  solutions  are  the  Whitaker  functions 
MKtfl  and  WKjli  such  that 

=  e  2+^M(—  +  /x  —  k,  1  +  2/x,  t) 

WKJt  =  e-5ttl+^(I  +  /x-K,l  +  2/x,t)  (63) 

where  M  and  £/  are  the  regular  and  singular  confluent  hypergeometric  functions.  Since  we  are 
actually  solving  the  problem  of  stationary  waves  in  a  shear  flow,  we  know  that  the  point  at 
which  the  velocity  vanishes  is  a  critical  layer  for  the  waves.  The  singularity  of  the  equations 
near  Y  =  0  reflects  the  presence  of  this  critical  layer.  As  a  result  we  expect  solutions  of  the 
kind  [7] 


ipT  oc  tP\  (t) 

rps  oc  fln|t|Pi(f)  +Pi{t)  (64) 


with  Pi(t)  =  no  +  ait  H - and  P2(t)  =  fto  +  M  +  &2i2  H - near  the  origin,  we  know  that 

we  should  take  /x  =  1/2  [6] .  The  expansion  of  the  functions  near  the  origin  is  then 


1/2 

WM/2 


-  «,  2,  t)  =  £  (1  "|n- 

n=0  '  '  'n 

—  k,  2,t) 

f (=^  M(t  -  «,  2.  t)  to  t  +  £)  (1  (~  (#  (1  -  x  +  n) 

#(1  +  n)  ~  #(2  +  n))  +  ^ 


(65) 


(66) 


where  ^r(a)  =  r'(a)/F(a).  We  see  that  in  order  for  WK^  to  be  well  defined,  we  need  to 
choose  t  positive  everywhere,  which  means  taking  a+  =  2m  for  Y  >0  and  a.  =  —2m  for 
Y  <  0.  This  also  means  that  the  Y  >  0  and  Y  <  0  branches  will  have  different  values  of  k: 
k+  =  —l/Am  and  k.  =  l/4m. 

Let’s  now  write  the  full  solutions: 


ipm{Y  >  0)  =  AMK+._{2mY)  +  BWK+i_[2mY)  (67) 

i>m{Y  <  0)  =  CMK  i_{-2mY)  +  DWK  i{-2mY)  (68) 


Since  MK  i(0)  =  0,  and  WK  i(0)  =  r[I^)>  the  continuity  of  tpm  across  the  origin  implies 


B  __  D 
T(1  +  l/4m)  “  T(1  -  l/4m) 


Integrating  equation  (56)  across  Y  =  0,  we  get  the  jump  condition 

dlprn  °+  _  f0+  tuLdY  +  — 

dY  0_  J0.  2 Y  +2tt 


(69) 


(70) 
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The  integral  term  on  the  RHS  is  mathematically  ill-defined.  However,  if  we  assume  that  it 
should  really  be  the  principal  value  of  this  integral,  then  we  can  show  that  this  term  is  0 
using  the  expansion  in  Y  of  ip  near  Y  =  0,  and  we  are  left  with  the  simple  jump  condition 


dAn  °+  .  Q 

dY  0_  2tt 


(71) 


Since  we  have  M'  2  (0)  =  1  and  W'  i(0)  =  lnt  +  c(k)  where  c(«)  is  a  constant 

term,  we  see  that  the  singular  part  of  the  derivative  is  continuous  across  the  origin  when  the 
function  i pm  itself  is:  indeed,  the  derivative  is 

</4«>+)  =  2m(yl  +  B^2l2  +  Bc(K+))  (72) 

C(O-)  =  -2m  (c  +  +  DcI-k-'i)  (73> 


so  that  the  continuity  of  the  singular  part  of  the  derivative  implies 

B  D  ^  B  _  D 

r(l/4ro)  ~  T(— l/4m)  ^  T(1  +  l/4m)  T(  1  -  l/4m) 


(74) 


using  the  property  T(1  +  x)  =  £r(:r). 

This  comment  implies  that  although  the  derivative  of  the  function  ipm  becomes  singular 
near  the  origin,  it  is  still  possible  to  have  a  finite  jump  of  the  derivatives  across  the  origin. 
The  asymptotic  behaviour  and  the  jump  condition  define  uniquely  the  four  coefficients  A,  B, 
C  and  D  for  each  value  of  m  but  0,  to  yield  a  unique  solution  for  the  far  field  depending  only 
on  the  vortex  strength  Q. 


3.1.3  The  axisymmetric  (m=0)  case 


oscillatory  solution  ip  =  c  cos 


+  d  i 


There  is  here  an  arbitrariness  in  the 


In  this  case,  the  solutions  do  not  decay  at  infinity.  In  fact,  we  see  that  for  Y  >  1,  the  equation 
becomes  Vq  =  0.  which  has  the  general  solutions  ipo  =  aY  +  b,  and  for  7-C-l  there  is  an 

choice  of  the  boundary  conditions,  which  is  solved  by  the  matching  with  the  inner  solution. 
For  the  purpose  of  plotting  the  results  only,  we  chose  to  take  the  following  boundary  conditions 
ip0(Yc)  =  iP0(-Yc)  =  0.  Again,  there  exists  a  unique  solution  fulfilling  these  2  boundary 
conditions  and  the  jump  condition  at  the  origin. 


3.1.4  Numerical  procedure  and  results 

Having  established  that  there  exists  a  unique  solution  to  the  problem,  it  is  now  easy  to  find 
it  numerically.  We  start  by  integrating  equation  (56)  from  +00  and  —00  towards  the  origin 
using  the  asymptotic  behaviour  as  a  first  boundary  condition.  We  define  a  free  parameter  h 
as  ipm( 0)  =  h,  and  use  this  as  a  second  boundary  condition  for  both  branches  of  the  solution. 
We  then  calibrate  this  parameter  h  so  that  the  jump  across  the  origin  is  indeed  Q/2n.  Since 
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both  parts  of  the  solution  are  linear,  increasing  h  by  a  factor  of  2  amounts  to  increasing  Q 
by  a  factor  of  2:  there  is  a  linear  relation  between  h  and  Q,  namely 

^  =  a(m)h  (75) 

The  coefficients  s(m)  can  be  found  numerically  by  fixing  h  —  1.  The  result  is  shown  in  Fig.5, 
on  the  left.  On  the  right,  the  resulting  solutions  for  Q  —  —50  are  shown.  Note  the  slow 
convergence  of  the  modes  for  large  m.  This  is  due  to  the  fact  that  the  point  vortex  is  a 
logarithmic  singularity,  and  the  amplitude  of  the  modes  vary  as  1/m. 


Figure  5:  On  the  left,  the  coefficients  s(m)  have  been  calculated  and  are  represented  as  a  function 
of  m.  On  the  right,  this  calibration  has  been  used  to  calculate  the  functions  for  m  =  0, 1, 2, 3, 4, 5 
for  Q  =  -50.  Only  the  first  few  have  been  labelled.  The  cutoff  for  ipo  has  been  chosen  at  Yc  —  10. 

Finally,  we  are  left  to  sum  the  Fourier  coefficients  to  reconstruct  the  function:  we  have 

oo 

=  V-0  +  2  ipm(Y)  cos  (ml)  (76) 

771=1 

The  contour  lines  of  the  total  stream  function  (the  perturbation  and  the  Keplerian  shear) 
have  been  plotted  for  4  values  of  Q ,  and  are  represented  in  Fig.6.  In  all  cases,  the  summation 
over  m  has  been  truncated  at  m  =  20. 

3.1.5  Discussion 

The  solutions  obtained  correspond  well  to  what  might  have  been  expected.  There  are  here 
two  main  features  to  the  result.  Firstly,  the  presence  of  a  point  vortex  in  any  shear  flow 
induces  the  deformation  of  stream  lines  seen  in  Fig.7.  This  type  of  deformation  is  also  seen 
in  the  results  presented  here.  The  second  feature  corresponds  to  the  presence  of  the  critical 
layer  at  the  radius  R  =  1,  and  is  characterized  by  the  discontinuity  in  the  velocities  at  that 
radius.  This  is  qualitatively  similar  to  the  case  of  the  Cat’s  Eyes  patterns  seen  in  the  plane 
parallel  shear  flows  [7].  The  linear  approximation  theoretically  fails  as  Y  — >  0,  and  a  full 
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Figure  6:  Stream  line  contours  of  around  an  anticyclonic  point  vortex  in  Keplerian  shear  flow,  Q=l, 
(upper  left),  Q=10  (upper  right),  Q=-l  (lower  left)  and  Q=-10  (lower  right) 

non-linear  theory  would  normally  be  necessary;  however,  it  was  shown  that  the  nonlinear 
boudary  layer  simply  connects  to  the  linear  branches  of  the  solution  far  from  the  critical 
layer,  without  change  in  the  phase  of  the  logarithm,  so  that  the  solution  found  here  is  a  good 
approximation  to  the  nonlinear  solution  provided  Y  »  e 

3.2  Close  to  the  vortex 

In  this  case,  we  want  to  chose  a  new  scaling  to  represent  the  region  near  the  vortex.  Let’s 
chose  to  take  Y  =  ey  and  X  =  ex,  and  expand  the  equations  in  e,  assuming  that  6«1.  We 
also  assume  the  following  form  for  the  stream  function  ip  and  the  vorticity: 

ip  =  e2(ipo  +  eipi)  (77) 

<jj  =  u>o  +  ewi  (78) 
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Figure  7:  Stream  lines  around  a  cyclonic  (left)  and  an  anticyclonic  (right)  vortex  positioned  at  a 
zero-velocity  point 


The  Jacobian  equation  becomes 

J  (“|y2  _  ^Fey3  +  v»0  +  eV»i  +  0(e2)  , 

|(1  -  ^ey)  +  (1  -  2ey)V2V'o  +  eV2^  +  0(e2))  =  0  (79) 

and  we  also  have 

w0  =  VVo  (80) 

wx  =  VVi-2yVVo  (81) 

The  successive  orders  in  e  from  the  Jacobian  yield 

j(^o-^y2,vVo)  =  0  (82) 

^(^o-|y2,-^y  +  wi)+J^i-^ry3,vVo)  =  o  (83) 

3.2.1  Zeroth  order  solutions 

The  solutions  to 

J  (^o  -  \y2'  V2^o^  =0  (84) 

are  well  known,  and  have  been  calculated  in  the  previous  section:  taking  piece-wise  constant 
solutions,  we  get  an  elliptical  vortex  patch  of  constant  vorticity.  This  suggests  the  use  of  the 
elliptical  orthogonal  coordinate  system  (x,  C)>  such  t^at 

x  =  fx  C _ 

y  =  /Vx2  -  i\/i  -  C2  (85) 

with  1  <  X  and  -1  <  C  <  1-  The  boundary  of  the  vortex  is  given  by  x  —  a  so  that  the 
solution  can  be  written  as 


wo  =  qU{x  ~  a) 


(86) 
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The  stream  function  if) o  is  then  given  by  equation  (82).  The  Laplacian  in  the  elliptical 
coordinate  system  is 


V2V>o  = 


P(X2~  C2) 


VX5 


+  y/l-C4 


d( 


l-C 


dx 

2dtpo 

d( 


X2  ~  1 


dipo 


dx 

=  <M{a  -  x) 


(87) 


Looking  for  separable  solutions,  such  that  i/j o(x>C)  =  Sn  Gn{x)Hn{C):  we  must  solve 

VX2  -  1^  (Vx2  -  1^L)  =  +  g/V  (88) 

vT^C5!  (vT^C5^)  =  -AX-9/2 C2 


(89) 


The  homogeneous  part  generates  the  Chebyshev  polynomials  for  both  x  and  (  with  A  =  n. 
The  polynomials  form  the  basis  for  the  regular  solution.  The  Hn  solutions  must  always  be 
regular,  so  that  we  simply  have 


00 

#(0  =  5>nTn(C)  (90) 

71=0 

where  the  Tn  are  the  Chebyshev  polynomials  of  the  first  kind.  However,  because  of  the 
matching  of  we  also  need  to  find  the  singular  solution  for  G  outside  the  vortex.  In  order 
tn  du  this,  let’s  perform  the  change  of  variables  x  =  cosh  a 


\\V  have  the  solutions,  for  n  >  0 


Gn  =  CLnena  +  ane  na  — 


which  defines  the  functions  Rn  and  Sn,  and  and  for  n  =  0 

Go  =  a,Q  ln(%  +  yj x2  —  1)  +  do  —  cioSo  +  do  (93) 

which  defines  So,  and  Ro  =  1.  The  special  solution,  necessary  inside  the  vortex,  is  a  second 
order  polynomial  in  the  variables  %or  (  respectively,  so  that  the  final  solution  is 

K  =  E  (^nTn(x)  +  (x2  -  2)^0  (bMO  +  (c2  -  2)20  (94) 

oo 

r0Ut  =  E  CnSn(x)Tn(0  (95) 

71=0 


d2Gn 

da2 


=  Tl2Gn 


(91) 


an  (x  +  Vx2  -  l)  +  dn  (x  +  Vx2  -  l) 

”1*  O'Ti&n 


(92) 
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where  we  have  used  the  fact  that  the  solutions  must  be  regular  inside  the  vortex,  and  that 
they  should  decay  outside  the  vortex.  Note  that  we  can  rewrite  x2  =  (To (2)  +  T2(x))/2  = 
(1  +  T2(x))/2.  The  matching  condition  at  the  boundary  x  —  a  yields  the  following  relation 
between  the  coefficients  4: 


AnTn(a)  +  (a2  —  2)— —  ^  ^ Bn  +  ^  —  CnSn(a)  for  n  >  0 


A0  +  (a>-2)f)(B0-3^ 


)“ 


Co  So  (a)  for  n  =  0 


The  matching  of  the  derivatives  yields  a  similar  system, 


(AnT^a)  +  qf2(a-l))(Bn  +  ^-6n^  =  CnS'n(a)  for  n  >  0 
<?/2(a  -  1)  (s0  -  3^0  =  C0S'0{a)  for  n  =  0 

Using  equation  (82)  with  the  fact  that  =  0  yields 

I  (*°  -  w 


=  0 


(98) 


or  rather, 


x-a 


=  ^0(a,C)-^r/2(a2-l)(l-C2)  =  c 


where  c  is  a  constant.  This  implies  that  we  must  take 


(99) 


CnSn(a)  =  |/V  -  l)(Snjo  -  l-6nl)  (100) 

so  we  see  that  only  2  coefficients  are  non-zero,  namely  Co  and  C2. 

If  we  were  to  match  this  with  the  Keplerian  shear  flow,  and  ignore  the  far  field  solution, 
we  would  then  obtain  a  unique  relation  between  the  size  of  the  vortex  a  and  it’s  vorticity  q. 


3.2.2  First  order  solutions 


We  now  have  to  solve  equation  (83).  The  Jacobians  in  this  equation  can  directly  be  trans¬ 
formed  into  Jacobians  for  the  new  coordinate  system:  with 


J(A,B) 


dAdB_  _  dAdB_ 
dx  d(  d(  dx 


(101) 


4In  order  to  derive  these  conditions,  we  use  the  orthogonality  relation  between  the  Chebyshev  polynomials 

J  Tn(C)Tm(£)  =  —Sm,n  if  71  7^  0  (96) 

=  7T<5m,n  if  n  =  0  (97) 
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we  get 


j(^o-^y\~y  +  ^+j(^i-^Ty\v2^  =0  (102) 

Since  V2ipo  =  -  x),  we  see  that 

J  (i>o  -  \v2,  -|y +  ^i)  =  -qS {a  -  x)^  (l>i  -  ^ry3)  (103) 


This  equation  suggests  the  ansatz  u)\  =  qrj(Q8(a  —  x)  +  W2,  so  that  we  have  the  condition 


dr)  d 
d(  dx 


x=a 


=  0 


(104) 


which  can  be  integrated  along  the  boundary  to  yield 


v(0^  (to  -  |y2)  _  +  Ma, 0  -  j|r/3(o2  -  i)3/2(i  -  C2)3/2  =  ^ 

The  equation  for  U2  is 


which  implies 


(105) 


(106) 


(107) 


As  in  the  far  field  solution,  we  must  chose  the  function  G  to  represent  the  presence  of  a 
vortex.  Ideally,  the  functions  F  should  be  the  linear  continuation  of  G  when  ip  <C  V’K-  As  a 
first  guess,  we  chose  to  take  simply  G  =  0,  so  that 


r 


W2=  2y 


(108) 


We  now  have  to  express  the  stream  function  ipi  as  a  function  of  y.  This  can  be  done  by 
solving  the  equation 

V2V>i  =  2 yqU{a  -  x)  +  V ?(CM(a  ~x)  +  ^V  (109) 

Set  'ipi  =  4>\  +  &  +  fa,  where  the  fa  satisfy  respectively 


V201 

=  qy{  0<5(a  -  x) 

(110) 

r 

=  2V 

(111) 

V2  fa 

=  2  qyH{a  -  x) 

(112) 
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In  all  three  cases  we  will  have  to  solve  the  homogeneous  equation  V2^  =  0.  This  has  already 
been  done  in  the  zeroth  order  case,  and  the  result  is 

00 

C'  =  +  C‘  S„(x))Tn«)  (113) 

n= 0 

outside  the  vortex,  where  the  singular  solution  must  be  kept,  and 

00 

4n  =  £4nTn(x)T„(C)  (114) 

n=0 


inside  the  vortex.  For  fa  and  03,  the  special  solutions  are  easy  to  find  and  we  get 


r  , 

02  =  <f>h  +  02, s  =  4>h  +  J2 y  Cl^  +  00 

03  =  <f>h  +  03, s  =  0ft  +  +  d\y  +  do)  ?Va  -  x) 


(115) 

(116) 


Note  that  the  solutions  axe  divergent  for  large  y,  and  that  the  true  solution  is  obtained 
bv  matching  the  near-vortex  solution  to  a  far  field,  wave-like  solution.  Inside  the  vortex, 
however,  the  solutions  must  be  regular. 

To  summarize,  renormalizing  the  coefficients  an  and  bn,  we  have 


00 

W  =  J2<Tn(x)Tn(0  +  4>2,s  +  h,s 

71=0 

00 

V>?Ut  =  E«ti?-W  +  bnUt'5n(x))7’„(C)  +  02,s  (117) 

71=0 


where  by  definition,  Ro(x)  =  1-  The  continuity  of  the  function  across  the  boundary  of  the 
vortex  implies  that 

|oi?r„(o)  +  ^==2r„(c)dc  =  |  «■•*»(»)  +  bTSnM)  (ns) 


for  n  >  0  and 

7T<  +  [ 1  =  7T  («r  +  So(a))  (119) 

J-i  Vi  -c2 

for  n  =  0.  For  &i ,  the  function  must  be  continuous  across  the  boundary,  but  the  derivative 
has  a  jump  given  by  equation  (112).  Integrating  (112)  across  the  boundary,  we  get 


1  o2-l 
P  a2  -  <2 


=  917(C) 


(120) 


Let’s  write  77(C)  =  X^o  rlnTn{0-  The  matching  condition  of  the  derivatives  therefore  implies 
that,  for  n  >  0 


{aTK(a)  +  bTS'n(a ))  -  aj?T»  =  qVn 


(121) 
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and  for  n  =  0, 

bTs'M  =  qm  (122) 

The  coefficients  rjn  can  actually  be  determined  from  self-consistently  using  equation  (105), 
provided  we  know  an  and  bn  for  all  n.  If  we  truncate  the  system  at  the  order  N  —  1,  there 
are  in  total  3 N  +  3  coefficients  to  solve  for,  and  2 N  matching  conditions.  The  remaining 
coefficients  are  given  by  the  matching  of  this  solution  to  a  far  field. 


3.3  Matching  of  the  far  field  to  the  vortex  solution. 


The  behaviour  of  the  far-field  is  mostly  determined  by  the  total  vorticity  Q  of  the  vortex  patch 
(with  the  exception  of  the  axisymmetric  component).  In  order  to  be  consistent  between  the 
far-field  and  the  close-field,  we  require  that  Q  —  e2q,  since  we  assumed  the  size  of  the  vortex 
patch  to  be  of  order  of  e.  The  aim  of  this  section  is  more  to  assess  whether  such  a  matching 
is  possible  rather  than  to  perform  it.  The  actual  matching,  as  we  shall  see,  can  only  be  done 
numerically,  and  will  be  the  aim  of  future  work. 

In  order  to  do  this  matching,  it  is  necessary  to  study  the  behaviour  of  the  inner  solution 
for  x  00  and  the  outer  solution  as  x  !•  The  elliptical  coordinate  system  asymptotically 
tends  to  the  polar  coordinate  system  as  x  >  1-  Indeed,  we  then  have  r2  =  x2  +  y2  k.  /2x2 
and  C  ~  cos  <p  =  xf \/x2  +  y2.  Also,  we  use  the  property  that  Tn(  cos</?)  =  cos  (nip),  and 
that  x  +  \/x2  ~  1  ~  2x-  As  a  result,  for  x  1>  the  inner  solution  tends  to 


pinner 


=  €2  cos {rup)  +  e3  [ogut  +  6gut  In  (x  +  Vix2  ~  1)) 

+  5Z  (°nUt  ( j)  +  6"Ut  (y )  j  C0s(n<p)  +  <f>2,s 


(123) 


On  the  other  hand,  as  we  saw,  the  outer  solution  tends  to 


oo 

^er=a±v,outer(,y|)  +  ^  (Am,±M±1/4m,i/2(|T|) 

n— 1 

+  Bm,±W±1/Amtl/2(\Y\))  cos (nX)  (124) 

where  the  ±  sign  refers  to  the  difference  in  the  Y  >  0  and  Y  <  0  branches.  All  the  coefficients 
Am,±  and  Bm>±  are  uniquely  defined,  with  the  exception  of  the  m  =  0  mode  where  we 
imposed  some  additional  boundary  conditions  to  determine  them.  These  may  be  taken  as 
free  parameters  if  necessary  to  perform  the  matching  on  to  the  inner  solution. 

Let’s  study  the  various  terms  that  appear  in  the  inner  and  the  outer,  and  that  may 
cause  problems  in  the  matching.  The  most  obvious  term  is  the  axisymmetric  term,  which 
has  the  main  component  as  (T/12 )e3y3  in  the  inner,  and  that  can  be  shown  to  behave  as 
co  +  ci  Y”  +  C2 Y2  +  c3Y3  + ...+  logarithmic  terms  in  the  outer.  The  Y3  terms  can  be  matched, 
since  we  can  to  choose  the  coefficients  c3  on  either  sides  of  Y  =  0  to  be  V/ 12.  This  is  possible 
since  we  had  the  freedom  of  varying  the  boundary  conditions  on  the  axisymmetric  mode  to  fit 
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this  requirement.  Next,  we  must  fit  the  logarithmic  terms.  The  main  logarithmic  dependence 
in  the  inner  comes  from  the  0(e2)  term.  As  we  take  y  —>  oo,  this  term  can  be  assimilated  to 
the  contribution  from  a  point  vortex  only  in  the  outer  region.  We  expect  this  term  to  match 
exactly  onto  the  outer  solution  for  a  point  vortex  only,  which  has  been  studied  in  Section 
3.2.5.  Finally,  it  can  be  shown  that  the  remaining  difference  between  the  point  vortex  case 
and  the  vortex+waves  case  is  non-singular,  so  that  this  could  possibly  be  matched  onto  the 
0(e3)  term  in  the  inner.  This  last  matching  would  yield  the  coefficients  an  and  bn,  and’ 
therefore  determine  the  shape  of  the  boundary  by  determining  77. 

4  Conclusion 

In  an  attempt  to  understand  the  dynamics  of  vortices  in  accretion  flows,  we  have  been  looking 
for  steady  state  solutions  of  such  a  system,  since  the  existence  of  stable  steady  states  might 
be  reason  for  the  observed  longevity  of  the  vortices.  The  first  part  of  the  project  was  a  simple 
attempt  at  finding  such  solutions  using  the  rather  crude  assumption  of  a  constant  vorticity 
field,  which  is  only  truly  justified  in  the  case  small  vortex  patches.  This  assumption  allowed 
us  to  consider  top-hat  vortex  solutions,  and  study  their  steady  state  shapes.  The  second 
part  of  the  project  was  an  attempt  at  dropping  this  assumption.  In  that  case,  it  has  been 
shown  that  a  general  stationary  lee  wave  solution  must  be  added  to  the  vortex  solution  in 
order  to  satisfy  the  vorticity  equation.  This  problem  can  only  be  solved  asymptotically  in 
two  limits:  far  from  the  vortex,  it  is  possible  to  find  linearized  solutions.  Closer  to  the  vortex, 
an  expansion  in  the  small  parameter  e  which  is  really  the  ratio  of  the  size  of  the  vortex  to 
the  distance  to  the  center  of  the  shear  flow,  yields  results  very  similar  to  the  first  section: 
to  zeroth  order,  we  recover  the  elliptical  vortex  solution,  and  to  first  order,  the  deformation 
of  the  vortex  matches  onto  the  “background  flow” ,  which  consists  of  the  Keplerian  flow  and 
the  lee  waves.  The  possibility  of  the  matching  between  the  two  solution  has  been  considered, 
and  will  be  the  purpose  of  future  work. 
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Some  analysis  of  a  two-dimensional 
double  diffusion  experiment 


David  Osmond 


1  Introduction 

Convection  driven  by  double  diffusion  occurs  when  two  properties  contributing  to  the  density 
of  a  fluid  diffuse  at  different  rates.  In  the  ocean,  the  density  of  water  is  governed  primarily 
by  heat  and  salt,  and  the  heat  diffuses  about  100  times  faster  than  salt.  It  is  now  clear  that 
double  diffusion  is  an  important  process  driving  convection  in  the  ocean,  especially  given  its 
ability  to  move  fluid  particles  across  isopycnals. 

Despite  the  primary  mechanisms  of  double  diffusion  being  formulated  back  in  1960  by 
Stern  [1]  as  he  considered  the  ’perpetual  salt  fountain’  experiment  conceived  by  his  colleagues 
a  few  years  earlier  [2],  there  remains  a  great  deal  to  be  learnt  on  the  subject.  The  two  simplest 
examples  of  double  diffusive  convection  are  salt  fingers  and  diffusive  layers.  Both  types  involve 
a  stable  stratification  of  one  component,  and  an  unstable  stratification  of  the  other,  although 
the  overall  density  field  is  always  stable.  Fingers  occur  when  the  faster  diffusing  component 
is  stably  stratified.  If  we  consider  the  heat-salt  situation  present  in  oceans,  salt  fingers  occur 
when  the  salt  is  unstably  stratified,  so  there  is  hot  salty  water  overlying  cold  fresh  water.  If  a 
perturbation  moves  a  fluid  parcel  downwards,  it  finds  itself  in  cooler  and  fresher  surroundings 
than  was  previously  the  case.  It  loses  both  heat  and  salt  to  the  surrounding  fluid;  however, 
since  temperature  diffuses  more  rapidly,  it  experiences  a  net  increase  in  density  and  thus 
continues  its  downwards  motion.  The  surrounding  fluid  gains  heat  from  this  descending 
finger,  and  it  in  turn  becomes  lighter  and  moves  upwards.  Eventually  the  region  becomes 
filled  with  fingers  moving  in  alternating  upwards  and  downwards  directions. 

Diffusive  layers  result  when  the  slower  diffusing  substance  is  stably  stratified.  Thus  in 
the  ocean  they  occur  when  cool  fresh  water  over  lies  hot  salty  water.  Temperature  diffuses 
faster  than  salt,  so  the  bottom  layer  is  heating  the  top  layer,  thus  driving  convection  in  a 
similar  manner  to  Rayleigh  Benard  convection.  The  top  layer  also  cools  the  bottom,  which 
also  drives  convection.  The  convection  tends  to  homogenise  the  fluid  above  and  below  the 
diffusive  interface  so  that  a  very  sharp  interface  results. 

These  two  processes  are  primarily  one-dimensional.  The  substances  in  the  mean  state 
have  gradients  only  in  the  vertical  direction.  The  situation  becomes  more  complicated  when 
there  are  gradients  in  both  horizontal  and  vertical  planes,  and  the  physics  of  this  process  are 
less  well  understood. 

In  1996  Turner  and  Veronis  resumed  work  on  an  experiment  Turner  had  been  considering 
for  a  long  time,  and  which  was  designed  to  look  at  one  example  of  the  two-dimensional  double 
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sugar 


salt 


Figure  1:  The  experimental  set-up.  Salt  and  sugar  are  slowly  pumped  into  the  left  and  right 
hand  side  respectively  of  a  long  tank.  Both  source  rates  are  5  ml  per  minute.  The  volume  of 
fluid  in  the  tank  is  kept  constant  by  using  a  constant  head  overflow  device  in  the  very  center 
of  the  tank. 

diffusive  problem.  Veronis’  suggestion  was  to  consider  the  simplest  configurations  possible, 
and  here  we  consider  one  of  them. 


2  The  Experiment 

A  long  thin  tank  (1820  x  80  x  120  mm)  is  constructed  with  an  inlet  at  the  center  of  each 
end,  and  a  constant  pressure  head  outlet  at  mid  height  in  the  center  of  the  tank.  It  is 
pictured  in  figure  1.  The  tank  is  filled  with  a  50-50  mixture  of  salt  and  sugar  with  a  density 
p  =  1100  kg/m3.  Sugar  is  chosen  as  the  second  diffusing  component,  as  it  does  not  diffuse 
through  the  side  walls  of  the  tank  as  does  heat.  Sugar  has  a  diffusivity  about  1/3  that  of 
salt,  and  so  in  this  system  plays  the  role  that  salt  does  in  the  ocean,  while  the  salt  plays  the 
part  of  heat  in  the  real  ocean. 

The  experiment  commences  by  slowly  pumping  a  salt  solution  with  density  p  =  1100  kg/ m3 
through  the  inlet  at  the  left  end  of  the  tank,  and  a  sugar  solution  with  the  same  density  in 
through  the  right  hand  inlet.  Both  flow  rates  are  very  close  to  5  cm3 /min.  Initially,  when 
the  salt  emerges  from  the  source  at  the  left  inlet,  it  is  much  saltier  than  the  fluid  in  the  tank, 
and  thus  salt  diffuses  out  of  it.  Sugar  also  diffuses  from  the  fluid  in  the  tank  into  the  salt 
plume,  but  at  a  slower  rate.  The  plume  gets  lighter  as  salt  diffuses  from  it,  while  a  sheath  of 
fluid  surrounding  the  plume  gets  denser  as  the  salt  from  the  plume  diffuses  into  it.  Thus  the 
plume  separates  into  a  light  core  which  convects  up  towards  the  surface,  and  a  dense  sheath 
which  sinks  towards  the  base.  The  opposite  process  occurs  at  the  sugar  source,  so  that  the 
core  sinks  and  the  sheath  rises. 

As  time  continues,  the  density  difference  between  the  the  top  and  bottom  of  the  tank 
increases.  The  rate  of  increase  slows  however,  and  after  some  time  (a  few  days  in  this 
experiment),  a  steady  state  density  field  is  formed.  Active  double  diffusive  convection  still 
continues  however,  although  it  does  not  alter  the  density  field  significantly.  To  first  order, 
the  steady  state  convection  consists  of  a  region  of  fingering  in  the  top  left  quadrant  of  the 
tank  above  the  salt  plume,  and  again  in  the  lower  right  quadrant  beneath  the  sugar  source. 
In  the  top  right  and  bottom  left  quadrants,  diffusive  layers  are  visible. 
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Left  hand  profiles 
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Figure  2:  Sugar,  salt  and  density  profiles  of  the  left  and  right  hand  side  of  the  experimental 
tank.  Salt  and  sugar  concentrations  axe  given  in  units  of  density 
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The  steady  state  sugar,  salinity  and  density  profiles  are  shown  in  Figure  2.  There  are 
distinct  differences  between  the  left  and  right  hand  side  of  the  tank,  as  would  be  expected  from 
the  antisymmetric  convective  patterns,  so  they  are  profiled  separately.  The  most  dominant 
feature  of  the  profiles  is  the  rapid  increase  of  density  at  mid  height.  In  particular,  in  both  the 
left  and  right  hand  sides  of  the  tank  most  of  this  increase  has  arisen  from  the  sugar  having 
settled  predominantly  to  the  bottom  half  of  the  tank.  In  contrast,  the  salt  is  marginally 
unstable  in  both  sides  of  the  tank,  having  slightly  higher  concentrations  in  the  top  half  than 
in  the  bottom,  and  rapidly  changing  extrema  at  mid  height. 

Looking  more  carefully  at  the  profiles,  we  can  broadly  divide  each  side  of  the  tank  into 
4  layers.  Starting  with  the  left  hand  side,  on  top  we  have  fairly  constant  concentrations 
for  about  50mm.  As  mentioned  earlier,  fingers  were  observed  in  this  region,  which  in  this 
experiment  are  associated  with  a  stable  salt  gradient,  and  an  unstable  sugar  gradient.  There 
are  some  indications  of  these  gradients,  although  they  are  clearly  very  small  compared  to 
most  of  the  other  features. 

The  next  10mm  contain  rapid  increases  of  both  salt  and  sugar,  thus  leading  to  a  very 
stable  density  profile.  This  acts  as  a  very  strong  barrier  to  any  kind  of  vertical  convection 
through  this  layer,  and  indeed  one  would  only  expect  pure  diffusion  to  transfer  properties 
from  one  side  to  the  other.  Measurements  of  the  profile  from  Figure  2  leads  us  to  estimates 
of  the  diffusive  salt  flux  to  be  of  order  0.14  mg/s,  while  the  sugar  flux  is  about  0.06mg/s. 
The  rate  at  which  both  salt  and  sugar  are  being  pumped  into  the  tank  through  the  source 
is  approximately  8.33  mg/s,  or  about  60  and  150  times  larger  than  the  pure  salt  and  sugar 
diffusive  fluxes  respectively. 

From  50-60mm  above  the  bottom  of  the  tank  there  is  a  very  stable  sugar  gradient,  and 
an  unstable  salt  gradient.  We  associate  this  with  a  strong  diffusive  layer,  and  indeed  that  is 
what  is  seen  in  the  experiment.  In  the  lowest  50  mm  the  sugar  concentration  gently  increases 
and  the  salt  decreases  downward  as  in  the  diffusive  layer  above. 

The  layers  are  reversed  in  the  right  hand  side  of  the  tank.  There  is  a  region  of  weak  diffus¬ 
ing  layers  on  top  of  a  sharp  diffusive  interface,  followed  by  a  layer  stable  in  both  properties, 
while  weak  fingering  occurs  in  the  lowest  layer.  The  level  of  the  sources  and  sink  corresponds 
very  closely  to  the  interface  between  the  strong  diffusive  layer  and  the  stable  layer. 

As  mentioned  earlier,  the  dominant  feature  of  the  profiles  was  the  fact  that  most  of  the 
sugar  was  in  the  bottom  half  of  the  tank,  while  the  salt  is  marginally  unstable.  This  is 
very  similar  to  the  profile  one  would  expect  if  one  had  run  a  diffusive  layer  experiment  by 
placing  a  salt  solution  above  a  sugar  one.  The  unstable  salt  stratification  drives  the  diffusive 
layers  until  the  salt  is  nearly  evenly  distributed  between  the  upper  and  lower  halves,  in  the 
process  raising  some  of  the  sugar,  but  not  as  much  as  the  salt  that  is  lowered.  Diffusive  layers 
do  a  better  job  of  reducing  the  potential  energy  of  a  fluid  than  do  fingers,  so  from  energy 
considerations  it  is  not  surprising  that  the  density  field  resembles  that  of  a  diffusive  layers 
experiment  rather  than  a  fingers  experiment.  But  given  that  both  diffusive  layers  and  fingers 
are  active  in  this  experiment,  it  is  not  immediately  clear  from  a  dynamic  viewpoint  as  to  why 
the  diffusive  layers  dominate  the  concentration  profiles. 

Given  that  the  sugar  travels  upwards  through  the  diffusive  layers,  and  downwards  through 
the  fingers,  we  present  Figure  3  as  a  simplistic  picture  of  the  salt  and  sugar  pathways  through 
the  tank. 
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Salt  pathways 
Sugar  pathways 


Figure  3:  Simplistic  picture  of  salt  and  sugar  pathways  through  tank. 


One  aspect  of  double  diffusion  experiments  that  has  been  looked  at  quite  closely  is  the 
ratio  of  the  fluxes  through  either  the  fingers  or  the  diffusive  layers.  For  salt-sugar  fingers  the 
ratio  of  sugar  to  salt  fluxes  is  around  0.9,  provided  the  density  ratio,  Rp  —  aTz//3Sz  is  not 
too  close  to  one.  For  salt-sugar  diffusive  layers,  the  ratio  of  salt  to  sugar  fluxes  is  around  0.6, 
again  provided  the  density  ratio  is  not  too  close  to  one.  Given  that  we  have  four  dominant 
flux  pathways  indicated  in  Figure  3,  and  four  diffusive  regions,  we  may  construct  a  series 
of  4  equations  with  4  unknowns.  Alas,  the  4  equations  are  not  independent,  nor  are  they 
even  consistent,  so  there  is  no  solution  other  than  the  trivial  zero  solution.  We  may  illustrate 
this  through  an  analogy  with  two  connected  water  wheels.  One  water  wheel  is  powered  by 
a  descending  sugar  solution,  and  it  in  turn  raises  a  salt  solution.  There  is  friction  in  the 
system,  so  that  it  can  only  raise  9  kg  of  salt  for  every  10kg  of  sugar  that  falls  through  it.  We 
shall  call  this  the  finger  wheel.  The  second  water  wheel  is  powered  by  the  descending  salt 
solution,  power  it  uses  to  raise  the  sugar  solution.  This  water  wheel  has  a  greater  friction, 
so  that  it  can  only  raise  6kg  of  sugar  for  every  10kg  of  salt  that  powers  it.  This  wheel  is 
called  the  diffusive  wheel.  The  two  wheels  are  connected  so  that  the  salt  raised  by  the  finger 
wheel  powers  the  diffusive  interface  wheel,  which  in  turn  raises  the  sugar  to  drive  the  first 
wheel.  As  the  wheels  are  not  100%  efficient,  the  system  slows  to  a  halt.  Clearly  we  need 
some  other  mechanism  to  get  either  salt  or  sugar  or  both  to  the  top  of  the  system  to  drive 
the  interconnected  components  so  that  they  run  continuously. 

Figure  3  hints  at  one  possibility.  In  the  early  stages  of  the  experiment,  while  the  tank 
was  still  close  to  being  homogeneous,  the  plumes  rapidly  split  into  a  core  and  sheath,  one 
part  rising  up  to  the  top  of  the  tank,  and  the  other  half  descending  to  the  bottom.  These 
are  represented  by  the  wiggly  lines  in  Figure  3.  In  the  final  steady  state,  this  process  is  not 
nearly  as  obvious,  but  never-the-less  there  are  possible  signs  of  it  still  occurring.  Another 
option  is  the  fact  that  most  of  the  diffusive  layers  are  inclined  at  an  angle.  This  can  be 
explained  through  the  experimental  evidence  that  fluid  is  moving  parallel  to  these  interfaces. 
Fluid  immediately  underneath  the  diffusive  layer  is  continually  getting  denser,  and  so  as  it 
moves  laterally  underneath  the  interface,  it  also  tends  to  sink,  thus  causing  the  interface 
to  slope  downwards.  The  fluid  above  the  interface  is  continually  getting  lighter,  thus  it 
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must  be  moving  in  the  opposite  direction  to  the  fluid  underneath  the  interface  so  that  in  its 
direction  of  motion  the  interface  is  sloping  upwards  to  accommodate  its  increased  buoyancy. 
This  advection  can  transport  sugar  or  salt  vertically  (as  it  travels  horizontally),  without  ever 
crossing  the  diffusive  layer  and  thus  being  constrained  to  the  flux  transport  ratios. 


3  Theory 


Before  delving  into  the  theory  of  diffusive  fingers  and  layers,  let  us  continue  with  the  coupled 
water  wheels  analogy.  For  the  wheels  to  turn,  we  require  an  additional  driving  force  to 
overcome  the  friction  of  the  system.  Let  us  derive  a  formula  for  the  ratio  of  the  flux  of  the 
driving  to  the  flux  transport  of  the  wheels. 

Let  us  define  Fti  to  be  the  salt  flux  indicated  in  figure  3  for  the  path  that  travels  from 
the  source,  up  through  the  fingers,  down  through  the  layers,  and  then  out  through  the  outlet. 
F< i  is  the  corresponding  value  for  the  sugar  flux.  The  two  wiggly  lines  flowing  upwards  from 
t  he  sources  represent  the  driving  terms  Ft2  and  Fs2-  The  driving  salt  flux  Ft2  bypasses  the 
h tigers,  using  a  different  mechanism  to  get  to  the  top,  but  then  joins  up  with  the  salt  flux 
Fr\  in  traveling  down  through  the  layers.  Similarly,  the  driving  sugar  flux  F$ 2  bypasses  the 
layers,  but  joins  up  with  Fsi  to  pass  through  the  fingers.  Let  7 j  ~  0.9  be  the  ratio  of  the 
sugar  to  salt  flux  through  the  fingers,  and  id  «  0.6  be  the  ratio  of  the  salt  to  sugar  flux 
through  the  diffusive  layers.  Thus  we  have  the  following  relationships: 
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7/ )  171  1  m  Td- 


Fsi  +  Fs2  ’  Ftl  +  Ft2 
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It  is  clear  that  in  all  situations,  the  driving  flux  is  able  to  generate  more  convection  than 
itself  in  the  fingers  -  diffusive  layers  system;  however  the  ratio  is  not  more  than  about  two. 
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3.1  Salt  Fingers 

The  linear  stability  analysis  of  the  perturbations  that  grow  into  salt  fingers  is  a  well-studied 
problem,  and  reveals  the  wavelength  of  the  disturbance  which  grows  fastest.  There  remains 
some  debate  as  to  whether  in  the  equilibrium  model  the  fingers  remain  with  this  width  (as 
proposed  by  Schmitt  [3]),  or  whether  they  obtain  a  different  width  which  maximises  the 
buoyancy  flux  (see  Stern  [4]  and  Howard  and  Veronis  [5]).  Most  theories  appear  to  agree  that 
the  salt  flux  through  fingers  scales  as  the  formula 


/3Fs  ~ 


MIAS)2 

aTzL 2 


where  L  is  the  buoyancy-layer  scale  defined  as 


L  = 


(6) 

(7) 


The  discrepancy  between  most  formulas  comes  into  the  formula  for  Tz.  It  is  the  unstable 
salt  field  which  drives  the  fingers,  and  thus  presumably  controls  the  gradients  in  the  finger 
zone,  so  ideally  one  would  like  to  rewrite  the  temperature  gradient  Tz  that  appears  in  (6)  in 
terms  of  the  salinity  difference  in  order  to  determine  the  flux  law  solely  as  a  function  of  /3AS. 
Many  people  favour  a  4/3  power  law  (see  Stern  [4],  section  11.4),  and  there  is  experimental 
evidence  in  support  of  this  (Turner  [6]).  However,  as  we  have  profiles  of  both  the  T  and  S 
field,  we  are  able  to  try  either  formula,  and  see  if  they  give  consistent  results.  The  formula 
we  shall  use  is  that  derived  by  Howard  and  Veronis  [5]  in  the  form  of  (6)  given  by 


fiFsi  =  0.1578 


ms)2 

Vf:  ’ 


together  with  Stern  and  Turner’s  [7]  empirically  fit  curve  to  the  4/3  power  law 

PFS2  =  CtfAS )4/3, 


(8) 

(9) 


where  C  =  10  4  m/s 

In  both  cases  we  assume  the  heat  flux  Ft  is  a  factor  of  0.91  [7]  times  the  salt  flux. 

3.2  Diffusive  Layers 

Turner  [8]  first  suggested  that  since  convection  through  diffusive  layers  was  similar  to  Rayleigh 
Benard  convection,  the  formula  for  the  heat  flux  should  scale  in  a  similar  fashion.  That 
is,  the  nondimensional  heat  flux  through  diffusive  layers,  given  by  the  Nusselt  number, 
Nut  =  Frd/ ktAT,  should  be  proportional  to  the  Rayleigh  number  to  the  power  of  1/3. 
The  reasoning  behind  this  relationship  is  that  the  length  scale  of  the  convective  rolls  is  not  in 
general  governed  by  the  size  of  the  tank,  and  it  is  this  relationship  that  removes  the  external 
length  scale  dependence.  It  follows  that  the  heat  flux  is  given  by 

c*Ft  =  C(RP)  (<?4/")1/3  (<*AT)4/3  (10) 
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1:  Fingers 

5:W.  Diffusion 

2: Stable 

6:S.  Diffusion 

3:S.  Diffusion 

7:  Stable 

4:W.  Diffusion 

8:  Fingers 

Figure  4:  The  geometry  of  the  8  Box  model 


The  function  C  has  quite  a  strong  dependence  on  Rp,  especially  for  heat  salt  systems. 
Shirtcliffe  [9]  found  his  experimental  results  fit  the  formula 

C  =  2.6i?p 12'6.  (11) 

He  also  provided  an  estimate  for  the  flux  ratio,  pFs/aFx  =  0.60. 

4  The  Flux  box  model 

As  mentioned  previously  in  section  2,  the  experimental  profiles  drawn  in  Figure  2  suggest 
the  tank  can  be  divided  up  into  eight  primary  regions,  four  on  each  side.  We  draw  them  in 
Figure  4. 

Table  1  provides  all  the  data  necessary  to  solve  the  flux  equations  given  in  the  preceding 
section.  To  get  the  total  flux  of  either  solute,  we  must  multiply  by  the  area  of  the  region 
where  the  fingering  or  diffusive  layer  is  present.  We  assume  this  area  is  constant  for  all  six  of 
the  interfaces  we  consider,  and  non-dimensionalise  it  by  half  the  actual  physical  area  of  the 
box  A,  writing  the  non-dimensional  area  by  Aj.  We  then  non-dimensionalise  the  fluxes  by 
the  input  salt  and  sugar  flux,  and  the  answers  are  listed  in  Table  1. 

One  clear  result  is  that  the  flux  through  box  1  is  much  smaller  than  any  of  the  other 
fluxes.  This  results  from  the  extremely  small  unstable  sugar  gradient  which  is  driving  it. 
There  is  a  large  amount  of  uncertainty  in  the  value  for  ft  AS,  indeed  it  may  be  up  to  a  factor 
of  three  bigger  than  the  best  fitting  line  used  to  generate  the  value  listed.  This  would  increase 
the  values  of  Fsi  and  Fti  by  a  factor  of  9,  but  they  would  still  remain  negligible  compared 
to  the  other  terms. 

Another  clear  result  is  that  nearly  all  the  fluxes  are  much  larger  than  physically  reasonable. 
Although  we  have  not  substituted  in  a  value  for  Af,  we  would  expect  it  to  be  of  order  one, 
meaning  that  the  entire  left  hand  of  the  tank  is  fingering  in  the  top  layer,  diffusing  in  the 
bottom,  and  vice-versa  in  the  right  hand  side.  The  dimensionless  flux  values  listed  imply 
the  dimensional  fluxes  are  often  over  100  times  greater  than  the  input  flux.  Referring  to 
our  previous  water  wheel  analogy,  a  small  amount  of  driving  can  generate  fluxes  through 


247 


Property 

Box  1 

Box  2 

Box  3 

Box  4 

Box  5 

Box  6 

Box  7 

Box  8 

height  (m) 

0.05 

0.01 

0.01 

0.03 

0.05 

0.01 

0.01 

0.03 

aAT 

0.8 

13.2 

-19 

-4 

-1.4 

-24.2 

7 

8 

/3AS 

-0.13 

15.5 

36 

6 

1.9 

44.6 

15 

-3.5 

Rp 

6 

- 

1.9 

1.5 

1.4 

1.8 

- 

2.3 

afz(m~1) 

16 

1320 

-1900 

-133 

-28 

-2420 

700 

-267 

aFsi/Af 

0.7 

- 

- 

- 

- 

- 

- 

130 

oFtJA} 

0.6 

- 

- 

- 

- 

- 

- 

110 

aFh/Af 

0.015 

- 

- 

- 

- 

- 

- 

11 

aFT2/Af 

0.013 

- 

- 

- 

- 

- 

- 

9.7 

aFh/Af 

- 

- 

54 

130 

no 

20 

- 

- 

aFh/Af 

- 

- 

90 

210 

190 

34 

- 

- 

Table  1 :  Properties  of  the  Eight  Boxes.  The  fluxes  have  been  normalised  by  the  source  input 
fluxes.  Af  is  the  area  of  the  convecting  region  divided  by  the  area  of  half  the  tank.  The 
subscripts  on  the  fluxes  refer  to  whether  they  were  predicted  by  equation  (8),  (9)  or  (10) 
respectively. 

the  coupled  diffusive  system  greater  than  the  driving  term,  but  by  no  more  than  a  factor  of 
two.  Clearly  a  factor  of  100  or  more  is  out  of  the  question,  even  if  it  were  possible  for  the 
entire  input  fluxes  to  reach  the  top  of  the  tank  by  some  unidentified  mechanism  to  become 
the  driving  flux.  Thus  there  must  be  a  problem  with  our  application  of  the  theory.  There 
are  a  few  obvious  suggestions.  Firstly,  the  formula  by  Howard  and  Veronis  was  derived  for 
two  fluids  which  have  vastly  different  diffusivities.  It  was  designed  for  the  heat-salt  system 
rather  than  our  sugar-salt  system.  They  allowed  the  salt  to  pass  through  the  fingers  without 
diffusing,  so  one  would  expect  their  formula  to  predict  a  larger  salt  flux  than  the  sugar-salt 
experiment  produces.  Secondly,  our  profiles  of  the  both  the  sugar  and  salt  in  most  cases  just 
<io  not  have  the  resolution  required  to  gain  accurate  values  of  the  salt  and  sugar  contrasts 
across  the  interfaces.  The  contrasts  that  appear  in  the  theories  refer  to  the  contrasts  that 
i  iccur  across  the  finger  or  layer  interfaces,  while  we  are  using  instead  the  contrasts  that  occur 
across  the  whole  box.  In  the  one  dimensional  theories  the  fluid  above  and  below  the  interfaces 
are  usually  well  mixed,  so  this  difference  is  not  important.  In  our  experiment,  the  regions  are 
distinctly  two  dimensional.  This  is  most  obvious  in  the  experiment  through  the  observation 
that  the  diffusive  layers  are  generally  inclined  at  some  angle  to  the  horizontal.  This  slope  is 
associated  with  advection  parallel  to  the  interface,  and  we  seem  not  to  have  the  well  mixed 
regions  above  and  below  the  interfaces  that  the  theories  assume.  Thus  the  difference  between 
the  sugar  or  salt  contrasts  across  the  interfaces,  and  the  corresponding  differences  across  the 
boxes  that  we  used  to  apply  the  theories  may  be  significant.  To  conclude,  we  suggest  that 
the  simple  theories  derived  from  one  dimensional  models  do  not  adequately  describe  this 
complicated  two  dimensional  system. 
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5  The  box  model 

In  the  previous  section,  we  divided  the  tank  up  into  the  eight  well  defined  regions  observed 
in  Figure  2.  In  that  model,  we  used  the  temperatures  and  salinities  found  on  the  boundary 
of  the  boxes  to  determine  the  fluxes  within  each  box.  Let  us  now  construct  a  similar  box 
model  corresponding  to  those  eight  regions,  but  now  assign  a  mean  temperature  and  salinity 
to  each  box  rather  than  boundary  values.  The  idea  in  this  model  is  to  try  to  calculate  the 
fluxes  between  the  boxes  as  opposed  to  the  fluxes  within  each  box. 

In  the  previous  section,  we  found  that  the  simple  one  dimensional  formulas  for  fingering 
and  diffusion  did  not  adequately  describe  the  experimental  system.  Let  us  discard  these 
theories  and  return  to  the  simple  assumption  that  the  flux  between  two  boxes  is  proportional 
to  the  difference  in  concentration  between  those  boxes.  Writing  a  formula  for  the  rate  of 
change  of  salinity  in  box  1,  we  get 

ViTi  =  612(T2  "  Ti)  +  b15(T5  -  T\)  +  TuC  —  Q]Ti  =  0.  (12) 

The  final  equivalence  is  due  to  the  fact  that  we  are  interested  in  finding  the  steady  state 
solution.  The  first  two  terms  on  the  right  hand  side  of  this  equation  are  the  fluxes  of  sugar 
into  box  1  from  box  2  and  box  5  respectively,  where  the  b  terms  axe  unknown  flux  transfer 
coefficients.  The  third  term  represents  a  source  input  term.  For  the  moment  we  will  assume 
that  the  salt  source  is  able  to  directly  inject  fluid  into  all  four  boxes  on  the  left  hand  side 
of  the  tank  (boxes  1-4).  C  is  the  salinity  of  the  salt  source,  also  equal  to  the  concentration 
of  sugar  in  the  sugar  source,  while  Tn  is  an  unknown  outflow  coefficient.  The  fourth  term 
represents  the  rate  of  outflow  of  salt  from  box  1,  where  Q\  represents  the  rate  at  which  fluid 
is  leaving  box  1,  which  has  salinity  T\.  Finally  V\  is  the  volume  of  box  1.  We  choose  to  use 
units  of  kg/m3  for  both  salt  and  sugar  concentrations,  so  the  transfer  coefficients  bij  have 
units  of  m3/s.  It  is  tempting  to  interpret  these  as  the  volume  flux  between  box  i  and  box 
j;  however  this  is  would  only  be  correct  if  the  flux  transports  were  due  to  advection  only, 
which  is  not  true.  To  further  emphasise  this,  we  use  different  transfer  co-efficients  for  the 
corresponding  sugar  equations,  ay. 

Inherent  in  the  above  equation  is  the  assumption  that  each  box  is  homogeneous.  Thus, 
the  flux  of  either  solute  to  the  outflow  is  equal  to  the  the  volume  flux  of  the  fluid  leaving  that 
box  times  the  concentration  of  that  solute.  In  addition,  we  assume  that  fluid  entering  each 
box  from  the  inflow  has  the  concentration  of  the  reservoirs,  and  thus  the  flux  of  either  salt  is 
proportional  to  the  volume  flux  times  the  concentration. 

There  are  similar  equations  for  rate  of  change  of  salinity  for  the  remaining  7  boxes,  as 
well  as  the  8  corresponding  equations  for  sugar,  and  they  may  be  found  in  the  appendix.  In 
addition,  there  are  two  more  conservation  equations.  They  are  simply  that  the  sum  of  the 
individual  salt  sources  is  equal  to  the  flux  of  the  salt  source,  C  =  QC,  as  is  the  sum 

of  the  individual  sugar  sources,  CYhjSii  —  QC-  There  are  also  two  similar  conservation 
equations  for  the  sum  of  the  outflow  terms,  however  they  are  already  implicitly  expressed  in 
the  previous  18  equations. 

The  experimental  data  gives  us  the  values  of  the  sugar  and  salt  concentrations  in  the 
boxes.  Thus  the  unknowns  are  the  10  transfer  parameters  for  each  of  sugar  and  salt,  the  8 
output  flux  terms  Qj,  and  the  4  input  flux  terms  for  each  of  salt  and  sugar  -  a  total  of  36 
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unknowns. 

As  of  the  moment,  we  have  19  equations  -  just  over  half  the  number  of  unknowns  -  a 
highly  under-determined  problem.  We  can  introduce  more  equations,  for  example  by  relating 
the  transfer  coefficients  in  some  way,  or  reducing  the  number  of  unknowns  by  restricting 
inflow  and  outflow  from  some  boxes,  but  for  the  moment,  let  us  discuss  a  method  of  finding 
solutions  to  under-determined  problems,  described  by  Veronis  in  General  Ocean  Circulation 
[10]. 

Our  eighteen  equations  may  be  written  in  matrix  form  Ax  =  b.  Here  A  has  19  rows  and 
36  columns,  or  more  generally  mxn,  where  m  is  less  than  n.  The  sizes  of  x  and  b  are  n  x  1 
and  m  x  1  respectively. 

The  key  to  the  method  is  to  assume  we  can  write  x  =  ATf,  where  AT  is  the  transpose 
of  A,  and  f  is  a  yet  to  be  determined  matrix  of  size  m  x  1.  f  satisfies  the  following  equation, 
AATf  =  Ax  =  b.  Now  AAt  is  square  (m  x  m),  and  if  it  has  a  non-zero  determinant,  we 
may  solve  for  f  =  (AAT)~1b,  and  thus  x  =  AT(AAT)-1b. 

We  have  thus  determined  a  unique  solution  to  an  undetermined  problem.  The  apparent 
paradox  is  explained  through  the  writing  of  x  =  ATf.  This  means  we  are  writing  the  solution 
vector  x  as  a  linear  combination  of  the  m  vectors  that  make  up  the  rows  of  A.  x  has  n 
components,  and  thus  defines  a  point  in  an  n  dimensional  space.  We  cannot  write  all  the 
points  in  the  n  dimensional  space  through  the  summation  of  m  vectors.  It  turns  out  that  the 
solution  this  method  returns  is  a  projection  of  the  (unknown)  true  solution  in  n  dimensional 
space,  onto  a  m  dimensional  space  defined  by  the  m  rows  of  A. 

Let  us  take  the  example  of  x+y  =  10,  so  A=[l,  1].  We  do  not  know  what  the  true  solution 
is,  other  than  it  lies  on  the  line  y  =  10  -  x,  but  its  projection  onto  the  one  dimensional  vector 
space  defined  by  the  one  row  of  A  is  x  =  y  =  5.  It  is  clear  in  the  formulation  of  the  problem 
that  we  have  not  treated  x  any  differently  to  y,  and  that  is  reflected  in  the  identical  values 
returned.  This  is  important,  as  if  we  distinguish  between  them  somehow,  then  that  can  make 
a  great  difference.  For  example,  let  us  non-dimensionalise  x  by  L,  and  y  by  2 L,  writing 
X  =  x/L  and  Y  =  y/(2L).  The  original  equation  may  now  be  written  X  +  2 Y  —  10/L,  and 
the  returned  solution  is  X  =  2/L ,  Y  =  4 /L,  which  lies  on  the  space  defined  by  the  vector 
(1,2),  the  row  of  A.  In  terms  of  the  original  variables,  the  solution  is  x  =  2,  y  =  8,  vastly 
different  to  the  previous  solution  (5,5)  even  though  the  equation  solved  was  identical.  This 
has  important  consequences  for  how  we  scale  our  problem.  We  must  choose  consistent  scales 
for  all  quantities,  as  otherwise  the  solution  will  be  biased  towards  those  quantities  that  were 
scaled  by  values  too  large.  It  is  clear  that  to  use  this  method,  if  there  is  no  reason  to  favour 
any  unknown  value  over  any  other,  then  it  is  important  to  reflect  that  in  the  formulation  of 
the  equations,  so  that  their  co-efficients  axe  equal. 

Figure  5  shows  how  the  solutions  to  the  box  model  equations  do  not  change  dramatically 
as  we  increase  the  number  of  equations  until  they  equal  the  number  of  unknowns.  Plotted 
are  the  inflow,  outflow  and  cross  box  salt  and  sugar  flux  transports.  The  fluxes  have  been 
normalised  so  that  the  sum  of  each  of  the  salt  and  sugar  inflows  is  100,  as  is  the  sum  of 
each  of  the  outflows.  In  picture  (a),  only  the  19  equations  listed  in  the  appendix  have  been 
used.  The  main  point  to  note  in  this  picture  is  that  the  horizontal  transports  are  an  order  of 
magnitude  bigger  than  the  vertical  transports.  In  addition,  the  inflows  and  outflows  to  the 
two  middle  layers  are  also  an  order  of  magnitude  larger  than  the  corresponding  values  for  the 
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Figure  5:  Salt  and  sugar  fluxes  given  by  the  solution  to  the  box  model  equations.  The  fluxes 
have  been  normalised  so  that  the  sum  of  each  of  the  inflows  is  100,  as  is  the  sum  of  the 
outflows.  In  (a),  only  the  original  19  equations  axe  solved.  In  (b),  there  are  an  additional  10 
equations,  corresponding  to  limiting  the  outflows  to  the  middle  two  layers,  and  applying  the 
six  vertical  dynamic  constraints,  (c)  is  an  exact  solution,  as  it  contains  the  same  number  of 
unknowns  as  equations.  It  differs  from  (b)  through  the  inflows  being  limited  to  the  3  boxes 
indicated,  while  the  outflows  have  been  limited  further  to  just  the  two  middle  layers. 
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Figure  6:  Fluxes  of  salt  and  sugar  given  by  the  solutions  to  the  box  model  equations.  Fluxes 
normalised  as  in  figure  5.  In  (a),  the  coefficients  of  the  inflows  have  been  reduced  from  Co  to 
Co/50.  In  (b)  the  source  inputs  are  limited  to  the  top  left  and  bottom  right  hand  boxes.  In 
both  cases  the  outflows  are  limited  to  the  middle  two  boxes  on  each  side,  and  the  six  vertical 
dynamical  constraints  described  in  the  text  are  applied. 
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upper  and  lower  layer.  As  a  result,  there  is  very  little  transport  of  sugar  or  salt  occurring 
in  the  upper  and  lower  layers.  In  (b),  there  are  an  additional  10  constraints,  consisting  of 
restricting  the  outflow  from  8  to  four  layers,  and  adding  6  vertical  dynamical  constraints. 
These  are  based  on  symmetry  arguments  and  are  012  =  a 78,  612  =  &78,  023  =  067>  &23  =  &675 
a34  =  CI56  and  634  =  656.  Notice  the  fluxes  are  very  similar  to  those  plotted  in  (a).  In  (c), 
the  solution  is  exact,  as  the  number  of  unknowns  is  the  same  as  the  number  of  equations.  To 
achieve  this  match,  we  have  had  to  restrict  the  inflows  to  just  the  3  boxes  indicated,  and  the 
outflows  to  the  two  boxes  indicated.  We  feel  these  constraints  are  too  harsh,  and  we  provide 
the  result  only  to  show  the  the  exact  solution  is  not  too  dissimilar  to  those  plotted  in  (a)  and 
(b). 

Our  reason  for  the  unexpectedly  small  fluxes  in  the  upper  and  lower  layers,  particularly 
in  Figure  5  (a)  where  we  have  not  forbidden  inflow  or  outflow  to  the  top  and  bottom  layers, 
is  that  we  have  formulated  our  original  19  equations  in  a  manner  that  is  biased  against  fluxes 
in  these  regions.  This  is  due  to  assuming  the  input  source  enters  all  boxes  at  the  source 
concentration.  This  is  a  reasonable  assumption  for  the  middle  two  layers,  but  not  so  for  the 
upper  and  lower  layers.  The  main  reason  for  allowing  inflow  and  outflow  from  the  upper  and 
lower  layers  is  to  allow  some  mechanism  to  drive  the  diffusive  ’water  wheels’.  While  we  have 
not  specified  what  that  mechanism  is,  although  we  suspect  it  is  due  to  vertical  transport 
along  the  sloping  diffusive  layers,  it  is  highly  improbable  that  the  flux  from  the  source  to 
the  upper  and  lower  layers  via  this  mechanism  arrives  with  the  concentration  of  the  inflow. 
We  have  seen  in  the  examples  provided  earlier  that  if  we  non-dimensionalise  a  term  in  the 
under-determined  system  of  equations  by  an  large  quantity,  then  that  term  dominates  the 
equation.  Thus  we  expect  the  inflow  terms  to  dominate  the  equations.  For  conservation 
arguments,  this  term  must  be  balanced  by  the  sum  of  the  other  terms,  and  the  co-efficient  of 
the  outflow  is  the  next  biggest  term,  so  it  is  the  next  most  dominant  term.  Thus  we  see  the 
cross-box  transport  seems  to  play  a  minor  role  in  these  conservation  equations. 

To  see  how  changing  the  scaling  for  the  input  terms  makes  a  difference,  in  Figure  6  (a) 
we  show  the  solution  to  the  system  of  equations  where  the  source  term  is  now  the  source 
concentration  divided  by  50.  We  can  see  in  this  figure  that  the  vertical  fluxes  is  now  a  similar 
order  of  magnitude  to  the  horizontal  fluxes,  and  we  have  convection  in  the  upper  and  lower 
layers.  This  solution  matches  nicely  the  schematic  drawing  of  the  fluxes  drawn  in  Figure 
3,  however  it  is  not  realistic,  as  we  should  really  only  rescale  the  input  terms  to  the  upper 
and  lower  boxes  in  this  way.  If  we  were  to  do  that  however,  we  would  somehow  have  to 
redistribute  the  lost  concentration  to  the  two  middle  layers,  the  method  by  which  to  do  so  is 
currently  not  clear. 

As  a  final  test  of  our  box  model,  we  try  restricting  the  source  terms  to  solely  the  upper 
left  and  lower  right  boxes.  The  resulting  fluxes  are  pictured  in  Figure  6  (b).  Notice  the  very 
large  recirculations  apparent  in  the  upper  and  lower  halves.  This  bears  a  nice  similarity  to  the 
water-wheel  analogy.  A  driving  flux  can  generate  larger  fluxes  in  the  coupled  double  diffusive 
regions  that  the  driving  flux  itself.  In  this  case  the  factor  is  greater  than  the  maximum  factor 
of  two  predicted  by  the  theory,  but  we  have  not  specified  any  flux  ratio  parameters  for  the 
finger  or  diffusive  regions,  and  thus  we  would  not  expect  a  match.  This  example  also  helps 
to  explain  why  we  see  very  little  inflow  or  outflow  from  the  upper  or  lower  layers.  It  does  not 
take  much  of  an  inflow  in  these  layers  to  drive  large  amounts  of  recirculation  in  the  middle 
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layers.  It  seems  logical  that  this  model  tends  to  reject  these  high-energy  solutions  when  there 
is  a  much  simpler  solution  with  no  flux  in  the  upper  and  lower  layers. 


6  Conclusion 

We  have  seen  that  a  coupled  system  of  fingers  and  diffusive  layers  cannot  be  sustained  adjacent 
to  each  other  without  some  other  driving  mechanism  to  transport  either  salt  or  sugar  to  the 
top  of  system.  With  driving,  the  resulting  fluxes  through  the  fingers  and  diffusive  layers  can 
be  up  to  twice  as  large  as  the  original  driving  term. 

The  one  dimensional  flux  laws  predict  fluxes  through  the  salt  fingers  and  -diffusive  layers 
up  to  a  few  hundred  times  the  source  fluxes.  These  are  much  too  large  to  be  considered 
possible.  While  the  flux  laws  used  are  not  without  question,  it  is  more  likely  that  the  two 
dimensional  problem  we  are  analysing  is  too  far  removed  from  the  one-dimensional  theories 
to  be  of  use,  in  addition  to  the  experimental  data  being  a  little  too  sparsely  separated. 

Our  box  model  only  permits  the  driving  flux  to  enter  the  top  of  the  system  at  the  same 
concentration  as  the  source.  The  large  co-efficient  of  this  source  term  dominates  the  vector 
space  of  the  possible  solution  set,  and  to  first  order  it  is  the  outflow  that  matches  this  term. 
The  cross  box  transports  play  only  a  minor  role  in  the  solutions.  Small  amounts  of  source 
flux  terms  in  the  upper  and  lower  layers  produce  large  amounts  of  recirculation  in  the  middle 
layers,  which  the  model  rejects  in  favour  of  the  less  energetic  solutions  that  contain  very  little 
input  terms  to  the  upper  and  lower  layers.  To  generate  non-negligible  amounts  of  convection 
in  the  upper  and  lower  layers,  we  require  a  theory  for  the  mechanism  behind  the  driving 
term,  which  we  may  then  use  to  scale  the  source  terms  for  the  upper  and  lower  layers. 
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8  Appendix:  The  box  model  equations 

The  model  has  nineteen  basic  equations,  comprised  of  a  sugar  equation  for  each  box, 
Si  =  012(^2  -  Si)  +  015(^5  -  Si)  +  CSn  -  S1Q1  =  0, 

52  =  Ul2(Sl  —  S2)  +  Cl23(S3  —  S2)  +  <226(56  —  S2)  +  CSi2  —  S2Q2  =  0, 

53  =  023(52  —  53)  +  034(54  —  S3)  +  037(57  —  53)  4-  CSiZ  —  S3Q3  =  0, 

54  =  034(53  —  54)  +  048(58  —  54)  +  CSi4  —  S4Q4  =  0, 

S5  =  ai5(Si  -  S5)  +  056(56  -  55)  —  S5Q5  =  0, 

Se  =  026(52  —  S3)  4-  056(55  —  S3)  +  067(57  —  S3)  —  S3Q3  =  0, 

S7  =  037(53  —  57)  +  067(56  —  S7)  +  078(58  —  57)  -  S7Q7  =  0, 

S3  =  043(54  -  5s)  +  078(57  -  5s)  —  SsQs  =  0, 

a  salt  equation  for  each  box, 

Ti  =  &i2(T2  -  Ti)  +  615(T5  -  Ti)  -  T1Q1  =  0, 

T2  =  6i2(2i  -  r2)  +  623 (T3  -  T2)  +  626(^6  —  T2)  -  T2Q2  =  0, 

Tz  —  623  (T2  —  T3)  +  634(^4  —  T3)  +  637(27  —  T3)  —  T3Q3  =  0, 

T4  =  634 (T3  -  T4)  4-  648 (Ts  —  T4)  —  T4Q4  =  0, 

T5  =  6i5(Ti  -  T5)  +  656 (26  -  T5)  +  CTi5  -  T5Q5  =  0, 

T3  =  626 (T2  —  Te)  +  656(75  -  T3)  +  667(27  —  Te)  +  CTj6  -  26<36  =  0, 

2>  =  637(23  -  T7)  +  667(26  —  2»  +  678 (2)3  —  2»  +  CTj7  -  T7Q7  =  0, 

Tg  =  648(24  -  T3)  +  673(27  -  T8)  +  CTi8  -  TsQ8  =  0, 
plus  three  conservation  equations, 

Qi  4-  Q2  4-  Qz  4-  Q4  4-  Q5  4-  <^6  4-  Q7  4-  Qs  =  2Q, 

Sa  4-  Si2  4-  Si3  4-  Si4  —  Q 1 

Tn  +  +  Ti3  +  Ti4  =  Q. 
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1  Introduction 

Experimental  observations  of  the  vertical  propagation  and  decay  of  near  inertial  oscillations 
( NIO)  through  the  oceanic  mixed  layer  has  stimulated  a  desire  to  understand  the  effect  of  NIO 
activity  on  mixing  processes  in  the  upper  ocean.  Ocean  surface  forcing  due  to  the  passage 
of  large  scale  wind  events  or  storms  instigates  the  formation  of  coherent  NIO  structures 
which  tend  to  migrate  in  a  helical  trajectory  as  evident  from  near  surface  buoy  drifters. 
Observations  also  indicate  that  NIO  mixed  layer  activity  eventually  decays  to  background 
levels  approximately  20  days  after  the  initial  onset  of  the  storm  [1].  A  major  challenge  to 
<  xcanographers  has  been  to  explain  the  primary  mechanisms  responsible  for  the  observed  time 
scales  of  NIO  propagation  and  decay.  Young  and  Ben  Jelloul  [2]  hypothesized  that  advective 
distortion  by  the  geostrophic  eddy  field  decreases  the  NIO  horizontal  coherence  scale.  Prom 
a  multiple  time  scale  analysis,  they  formulate  a  reduced  NIO  equation  linearized  about  the 
c.-ostrophic  flow.  This  analysis  effectively  filters  out  inertial  oscillations  allowing  focus  on  the 
near  inertial  component  of  the  motion.  Their  NIO  equation  combines  the  effects  of  advection 
by  the  geostrophic  velocity,  wave  dispersion,  and  refraction  due  to  the  geostrophic  vorticity. 
Subsequent  work  by  Balmforth  et  al.  [3]  investigated  results  from  the  NIO  equation  for 
t  lie  case  of  a  background  geostrophic  shear  flow.  The  present  study  extends  this  work  by 
considering  the  fundamental  properties  of  the  NIO  equation  for  the  specific  case  of  linearized 
NIOs  superposed  on  a  two  stage  random  wave  model  of  the  background  geostrophic  eddy 
held. 

We  begin  the  paper  with  an  overview  of  near  inertial  oscillations  and  describe  the  method¬ 
ology  used  to  obtain  the  reduced  NIO  equation  that  provides  the  basis  for  the  present  study. 
We  will  see  that  the  NIO  equation  is  characterized  by  a  parameter  termed  the  dispersivity, 
analogous  to  the  diffusivity  associated  with  passive  scalar  diffusion  processes.  We  then  dis¬ 
cuss  the  random  wave  model  used  to  represent  the  background  geostrophic  flow  and  some 
fundamental  properties  associated  with  this  type  of  model  flow.  We  then  study  some  limit¬ 
ing  parameter  cases  of  the  governing  equation,  specifically,  zero  and  infinite  dispersivity.  We 
present  data  from  the  numerical  solution  of  the  governing  equation  for  a  range  of  dispersivity 
values.  Finally,  we  take  a  look  at  the  decay  of  energy  in  the  large  scales  and  conclude  with 
comments  on  directions  for  future  research. 
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2  Near  Inertial  Oscillations  (NIO) 

Inertial  oscillations  describe  fluid  motions  arising  from  a  force  balance  between  fluid  inertia 
and  Coriolis  acceleration.  The  reduced  horizontal  momentum  equations,  in  a  reference  frame 
rotating  with  the  earth,  are 


ut  -  f0v  =  0  and  vt  +  f0u  =  0.  (1) 

where  the  subscript  t  represents  partial  differentiation  with  respect  to  time  and  fc  =  2S2  sin  6 
is  the  inertial  frequency  with  6  and  Q,  denoting  the  latitude  and  the  earth’s  rotation  rate, 
respectively.  Throughout  the  paper,  we  follow  the  convention  that  (u,v)  describe  the  hor¬ 
izontal  velocity  components  in  the  easterly  ( x )  and  northerly  (y)  directions,  respectively, 
and  z  refers  to  the  vertical  direction.  The  solution  to  (1),  assuming  constant  /,  is  simply 
U  —  U  e~ifot  where  U  =  u  +  iv  and  U  is  the  initial  velocity.  The  corresponding  particle 
trajectories,  x  +  iy  =  e~^ot,  form  closed  loops.  As  a  simple  model  of  the  passage  of  a 
storm  front,  we  assume  that  at  time  t  —  0,  an  instantaneous,  homogeneous  wind  event_occurs 
thereby  exciting  the  entire  horizontal  domain  to  move  with  a  uniform  velocity  of  U.  The 
ensuing  motions,  described  by  (1),  are  referred  to  as  inertial  oscillations.  Using  data  from 
mid  latitude  ocean  buoy  drifters  [1],  estimates  of  the  typical  diameter  of  inertial  oscillations 
is  of  the  order  of  5km.  In  reality,  however,  (1)  only  represents  the  leading  order  behavior  of 
the  flow;  and  therefore,  ensuing  motions  are  actually  near  inertial  oscillations  that  are  more 
accurately  characterized  by  helical  type  trajectories  and  have  a  finite  lifespan  in  the  mixed 
layer  of  approximately  20  days. 

Note,  in  the  solution  of  (1),  a  constant  inertial  frequency  f0  was  assumed.  This  assumption 
ceases  to  be  valid  if  a  coherent  fluid  motion  spans  a  large  enough  horizontal  extent;  then  the 
latitude  difference  (and,  hence,  the  change  in  fD)  between  the  most  northerly  and  southerly 
points  of  the  coherent  motion  can  no  longer  be  neglected.  For  simplicity,  we  neglect  these 
so-called  /?  effects  in  the  remainder  of  the  paper. 

3  Reduced  Linearized  NIO  Equation 

The  previous  work  of  Young  and  Ben  Jelloul  [2]  regarding  near  inertial  oscillations  provides 
the  basis  for  the  present  study.  We  briefly  summarize  the  relevant  points  of  that  work  here. 
We  begin  by  assuming  hydrostatic,  Boussinesq,  inviscid,  incompressible  flow.  The  velocity  is 
linearized  about  the  background  geostrophic  flow  which  can  be  written  compactly  in  terms  of 
a  streamfunction,  V{x,  y,  z,  t)  =  Pg/{f0Po )  with  Pg  denoting  the  geostrophic  pressure  field  and 
p0,  the  mean  density.  Further  reduction  of  the  linearized  governing  equations  (not  reproduced 
here)  is  achieved  through  a  multiple  time  scale  analysis  with  the  requirement  that  internal 
waves  be  nearly  inertial.  The  general  dispersion  relation  for  internal  waves  can  be  written  as 
u 2  =  {N2k2  +  /2k2)/k2  where  u>  is  the  oscillation  frequency,  (k^,  kv)  denote  the  horizontal 
and  vertical  wavenumbers,  respectively,  k2  =  k\  +  and  N0  is  the  characteristic  buoyancy 
frequency  associated  with  the  vertical  density  stratification  of  the  fluid  (for  further  discussion 
of  internal  waves,  see  Gill  [4],  pp.  258).  We  define  a  small  parameter  e  =  {N0Kh)/(f0Kv). 
Physical  estimates  of  N0  in  the  North  Pacific  and  North  Atlantic  indicate  N0/ f0  =  0(1O2) 
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[5].  Therefore,  in  order  for  e  *C  1,  the  characteristic  vertical  wavelengths  of  the  motion 
must  be  several  orders  of  magnitude  smaller  than  the  characteristic  horizontal  wavelengths, 
i.e.  kv  K/, .  This  is  entirely  consistent  with  our  use  of  the  hydrostatic  approximation 
(see  Gill[4],pp.  159  for  a  discussion  on  the  equivalence  between  longwave  and  hydrostatic 
approximations).  The  internal  wave  dispersion  relation  then  reduces  to  oj2  «  f2(  1  +  e2); 
whereby,  inertial  oscillations  are  recovered  at  leading  order.  Departures  from  perfect  inertial 
oscillations  become  appreciable  on  the  slow  time  scale  ts  =  e2/0f. 

In  the  multiple  time  scale  analysis,  the  complex  velocity  U  is  expanded  in  powers  of  e2 
giving  U  =  Uq  +  e2U2  +  ■■■  where  the  leading  order  solution  Uq  is  simply  the  velocity  asso¬ 
ciated  with  the  inertial  oscillations  governed  by  (1).  For  convenience  we  write  the  leading 
order  solution  as  Uq  =  Mz(x,y,z,ts)e~lfot  which  allows  for  trivial  integration  of  the  incom¬ 
pressibility  condition  to  obtain  the  leading  order  vertical  velocity  explicitly.  If  we  further 
define  a  new  complex  field  A  such  that  M  =  (f2N~2)Az,  then  it  is  also  possible  to  explicitly 
calculate  the  leading  order  pressure  by  integrating  the  hydrostatic  equation  along  with  the 
mass  conservation  equation.  In  essence,  A  incorporates  all  of  the  relevant  physical  quantities 
of  interest.  Therefore,  we  prefer  to  work  solely  with  the  dependent  variable  A.  Substituting 
the  definition  of  A  into  the  leading  order  horizontal  velocity  solution  yields  an  expression  for 
the  demodulated  velocity  of  the  NIO 


u  +  iv  —  e  fotLA,  (2) 

where 

LA  =  {foN~2Az)z-  (3) 

We  find  that  the  0(e2)  equation  contains  resonant  terms  proportional  to  e~lfot.  To  prevent 
related  secular  terms  from  arising  in  the  higher  order  correction,  we  require  that 

LAt  +  J(tf,  LA)  +  \foV2A  +  WHLA  =  0,  (4) 

where  V2  =  dxx+dyy  represents  the  horizontal  Laplacian  operator  and  J(^,  LA)  =  ^x{LA)y— 
\ $y(LA)x  is  the  Jacobian.  One  advantage  of  (4)  is  that  the  first  term  on  the  left  hand  side  has 
the  direct  physical  interpretation  of  being  the  time  rate  of  change  of  the  horizontal  velocity. 
The  vertical  boundary  conditions  demand  zero  vertical  velocity  at  the  top  and  bottom  of  the 
ocean,  translating  into 


Az{x,y,0,t)  =  Az(x,y,-H,t)  =  0  (5) 

where  H  is  the  depth  of  the  ocean.  This  condition  follows  the  rigid  lid  approximation  that 
assumes  the  typical  amplitude  of  surface  waves  axe  negligible  compared  to  the  vertical  wave¬ 
length  of  the  propagating  NIOs.  Normalized  horizontal  boundary  conditions  are  In r  periodic. 
The  initial  condition  depends  on  how  one  chooses  to  model  the  passage  of  the  storm  or  other 
instigating  event.  We  will  specify  this  condition  later.  The  present  study  will  focus  on  in¬ 
vestigating  some  of  the  fundamental  characteristics  of  the  NIO  equation  (4)  in  the  specific 
context  of  a  simple  random  wave  model  of  the  background  turbulent  geostrophic  eddy  field. 


259 


4  Vertical  Normal  Modes 


The  top  and  bottom  boundaries  of  the  ocean  have  the  effect  of  confining  wave  energy  to  a 
region  of  finite  vertical  extent.  Thus,  the  ocean  can  be  considered  as  a  waveguide  causing 
energy  to  propagate  along  the  horizontal  direction.  With  this  notion,  we  proceed  in  assuming 
a  solution  to  (4)  in  terms  of  a  superposition  of  vertical  normal  modes 

OO 

Am(x,y,t)Pm(z)cjm ,  (6) 

m= 1 

where  m  denotes  the  vertical  wavenumber,  Pm(z)  represents  the  eigenfunctions,  and  am 
represents  the  projection  of  the  initial  condition  onto  the  vertical  normal  modes.  Note,  the 
expansion  in  (6)  relies  on  the  assumption  that  the  background  geostrophic  flow  is  barotropic, 
i.e.  =0.  Applying  the  differential  operator  L  to  (6)  gives 


LA  =  L(AmPm)  =  {Am)LPm.  (7) 

Substituting  (7)  into  (4), 

LP m  -»(/o/2)V  Am _  _  _  t->— 2 

~P^  ~  Amt  +  J(y,Am)  +  i/2{V2y)Am  ~  m  ■  K) 

For  historical  reasons  [6],  eigenvalues  are  represented  as  R“2,  where  Rm  (dimensions  of  length) 
symbolizes  the  Rossby  deformation  radius.  Prom  (8),  we  obtain  a  partial  differential  equation 
for  .4,7, 


Amt  +  +  \{V2^)Am  =  ^pV2Am,  (9) 

where  hm  =  f0Rm  wiU  be  referred  to  as  the  dispersivityt  associated  with  the  mth  vertical 
mode.  Since  the  initial  condition  of  A  has  been  projected  onto  vertical  normal  modes,  the 
initial  condition  associated  with  each  Am  is  simply  Am(x,y,  0)  =  1.  Exact  numerical  values 
of  h m  depend  on  the  eigenvalues  of  (8)  which,  in  turn,  depend  on  the  shape  of  the  buoyancy 
frequency  profile,  N  =  N(z).  If  we  assume  jV=constant,  then  Pm(z)  oc  cos (N2  (f 0Rm)~2 z)  and 
R^  =  N / f0y/ H/im-it).  A  constant  buoyancy  frequency  profile,  however,  is  not  a  reasonable 
physical  model;  and  therefore,  we  chose  something  slightly  more  realistic.  In  this  regard,  we 
follow  the  work  of  Gill  [6].  Figure  1  shows  the  model  N  profile  used  here.  The  corresponding 
eigenvalues  are  computed  numerically  and  plotted  in  figure  2  using  a  value  of  f0  =  lxl0_4s-1. 

fThis  is  appropriate  nomenclature  since  (9)  begets  a  dispersion  relation.  Consider  only  the  time  derivative 
and  Laplacian  terms;  assuming  a  solution  of  the  form  ekz+ty~U!t  yields  the  real-valued  dispersion  relation 
—  h{k 2  +  £2).  This  is  particularly  interesting  since  (9)  “looks  like”  an  advection-diffusion  equation  but 
because  of  the  i  multiplying  the  Laplacian  term,  a  real-valued  dispersion  relation  is  obtained  analogous  to  a 
wave  equation. 
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Figure  1:  Buoyancy  frequency  profile  used  Figure  2:  Dispersivity  as  a  function  of  ver- 

to  calculate  hm.  tical  wavenumber  m. 


Of  interest  is  the  relatively  large  range  of  H  values  (over  four  orders  of  magnitude)  apparent 
just  within  the  first  10  vertical  modes.  We  expect  this  to  play  an  important  role  in  the 
developing  structure  of  A.  Also,  note  that  for  the  N  profile  in  figure  1,  h  m~2  as  compared 
to  H  m-1/2  for  the  case  of  iV=constant. 

Except  for  the  dependence  of  the  dispersivity  h  on  the  vertical  wavenumber  m,  A  appar¬ 
ently  satisfies  the  same  initial  value  problem  regardless  of  the  specific  vertical  normal  mode 
under  consideration.  Therefore,  in  the  remainder  of  the  paper,  we  drop  the  subscript  m  on 
A.  Focus  is  placed  on  understanding  the  general  behavior  of  the  initial  value  problem  given 
by 


At  +  J(%A)  +  ^(V2tf)A  =  |v2A,  A(x,y,  0)  =  1  (10) 

for  a  range  of  parameter  values  0  <  h  <  oo  and  a  specific  form  of  ft  detailed  in  section  6. 
Equation  (10)  will  be  referred  to  as  the  passive  scalar  dispersion  equation. 

5  Scalar  Dispersion  Equation 

An  important  aspect  of  (10)  is  that  the  quantity  |A|2  is  conserved  over  the  spatial  domain. 
To  see  this,  we  start  by  writing  A  in  terms  of  a  magnitude  R  and  phase  6 

A  =  Reie.  (11) 

Substituting  (11)  into  (10),  separating  real  and  imaginary  parts  and  dividing  by  et9n  yields 
evolution  equations  for  R  and  9,  respectively, 
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(12) 


Rt  +  J(¥,  R)  =  -£  [2 VRV9  +  RV20] , 

Li 

et  +  j(*,e)  =  =£  +  ^  ^-(W)2],  (13) 

where  (  =  V2#  represents  the  vorticity.  Multiplying  (12)  by  R  and  integrating  over  the  two 
dimensional  spatial  domain  gives 

5  /,  T  "  — V(TW)  ds  =  -hJcTvefdc  =  ~hlMdC’  <14> 

where  DR/Dt  =  dR/dt  +  J($,  R)  and  r  is  the  outward  normal  unit  vector  at  the  boundary. 
In  obtaining  (14),  we  have  used  the  product  rule  VI?2/2V0  =  V(1?2/2V0)  -  R? /2V29  and 
the  divergence  theorem.  Far-field  boundary  conditions  axe  utilized,  which  translates  into 
VA  ■  r  =  0,  on  the  boundary;  or  in  polar  representation,  =  0  and  =  0,  on  the 

boundary.  Note,  periodic  boundary  conditions  in  a  two  dimensional  box  automatically  satisfy 
the  far-field  conditions  due  to  the  fact  that  the  gradient  of  the  function  at  one  end  of  the 
periodic  domain  is  exactly  equal  and  opposite  to  the  gradient  at  the  other  end.  Application 
of  the  fax-field  boundary  condition  to  the  last  expression  of  (14)  leaves 

§ijv  1^  =  °-  (15) 

which  proves  our  initial  statement  at  the  beginning  of  the  section. 

We  contrast  (15)  with  the  case  of  passive  scalar  diffusion.  The  equation  governing  the 
evolution  of  a  scalar  concentration  field  c  in  a  background  flow  can  be  written  as 

ct  + J{^,c)  =  uW2c,  c(x,y,0)=co,  (16) 

where  v  is  the  molecular  diffusivity  coefficient,  in  analogy  to  the  dispersivity  of  the  scalar 
dispersion  equation  (10).  Following  the  same  procedure  for  A,  we  find 

(17) 

We  conclude  that  the  diffusion  process  tends  to  minimize  the  squared  scalar  concentration 
whereas  the  dispersion  process  conserves  this  same  quantity.  What  does  this  say  about  the 
generation  of  small  scales  or  the  cascade  of  energy  from  large  to  small  scales?  Does  A  exhibit 
a  Batchelor  scale/spectrum  [7]?  These  questions  have  motivated,  to  some  extent,  the  work 
herein.  At  this  point,  we  present  the  specific  streamfunction  model  used  for  the  remainder 
of  the  calculations  in  the  paper. 
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6  Two  Stage  Random  Wave  Model 

We  will  investigate  the  behavior  of  (10)  for  the  specific  two  dimensional,  random  flow  field 
described  by  the  following  streamfunction  T 

^  _  f sin (y  +  </>„),  2 nr  <t<  (2 n  +  l)r, 
n  [cos(a:  +  Xn),  (2rc  +  l)r  <  t  <  (2 n  +  2 )r, 

for  n  =  0, 1, . . . ,  where  <p,  x  are  uniform  random  phases  between  0  and  2i r  and  r  represents  a 
characteristic  decorrelation  time  of  the  turbulence.  A  variation  of  this  model  has  been  used  in 
the  past,  [8].  Note,  we  have  nondimensionalized  $  by  a  characteristic  streamfunction  \&0  and 
the  spatial  coordinates  by  a  characteristic  horizontal  wavenumber  .  The  spatial  domain  is 
thus  2n  periodic.  In  order  for  the  model  to  have  physical  relevance  to  the  oceanic  geostrophic 
eddy  field,  we  have  taken  '■l/0  =  3000m2/s  and  K-n  =  6xl0-5m-1,  based  on  the  data  of  [1]. 
Together,  these  give  a  characteristic  decorrelation  time  scale  of  r*  =  O(lday).  The  velocity 
components,  u  =  (u,v),  follow  from  the  definition  of  the  streamfunction  as  u  =  —^>y  and 
v  =  'bx.  The  corresponding  particles  trajectories  are 


Xn+1  =  Xn  -  cos {yn  +  (f>n)t ,  2nr<K(2n+l)r  (19) 

Vn+1  =Vn  +  COs(a;n+i  +  Xn)t  (2n+l)r<t<(2n+l)r.  (20) 

As  apparent,  the  flow  model  is  characterized  by  a  two  stage  advection  process.  In  the 
first  stage,  during  time  intervals  2 nr  <  t<  (2n  +  l)r,  particles  are  advected  in  the  x  direction 
only  for  a  time  r;  while  in  the  second  stage,  during  (2n  +  1  )r  <  t  <  (2 n  +  1  )r,  particles 
are  advected  in  the  y  direction  only  for  a  time  r.  The  combination  of  these  two  advection 
stages  constitutes  a  single  step  in  the  random  wave  model.  The  total  time  to  complete  n 
steps  is  then  t  =  n(2r).  The  main  advantage  of  the  two  stage  random  model  stems  from 
simplifications  in  the  subsequent  mathematics  as  will  be  described  later.  It  is  worthwhile  to 
analyze  the  two  stage  random  flow  field  in  terms  of  the  effect  on  material  line  stretching  and 
fluid  particle  diffusivity.  We  compare  the  two  stage  model  of  (18)  with  a  one  stage  model 
such  as  the  rennovating  random  wave  model, 


'J'n  =  COS  [x  COS  <}>n  +  y  Sm  <j>n  +  Xn]  ,  (n  -  1  )t  <  t  <  HT,  (21) 

where  advection  is  performed  in  a  single  stage. 

6.1  Material  line  stretching 

We  investigate  material  line  stretching  induced  by  (19-20)  in  the  context  of  Lyapunov  expo¬ 
nents.  Lyapunov  exponents  are  used  extensively  in  the  study  of  nonlinear  dynamical  systems 
as  a  measure  of  whether  two  initial  conditions  diverge  exponentially  in  time,  thus  possibly 
leading  to  the  onset  of  chaos.  The  same  concept  is  often  applied  to  neighboring  fluid  particles 
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in  the  study  of  fluid  turbulence.  In  this  manner,  the  Lyapunov  exponent  gives  some  indica¬ 
tion  of  the  stretching  of  differential  line  elements  in  the  flow.  In  the  case  of  advection  of  a 
real-valued  passive  scalar,  this  can  be  directly  related  to  the  development  of  spatial  gradients 
in  the  scalar  field.  We  will  consider  later  whether  an  analogy  to  the  complex-valued  scalar  A 
exists. 


6.1.1  Lyapunov  exponent:  background 

We  follow  the  general  definitions  and  methodology  of  Seydel  [9]  regarding  the  Lyapunov 
exponent  calculations.  An  initially  small  ring  of  fluid  particles  with  initial  radius  p0  deforms 
into  an  ellipse  with  major  axis  p  due  to  regions  of  localized  strain  in  the  flow.  Linearizing  the 
flow  about  the  origin  p0  =  0,  we  obtain  hn  =  3nh0  where  J  denotes  the  Jacobian  matrix  of 
the  random  map  given  by  (19-20),  and  h0,  hn  describe  the  particle  positions  in  the  original 
(circular)  and  deformed  (elliptic)  configurations,  respectively.  Strictly,  hn  and  h0  must  be 
differential  vectors  for  the  linearization  to  be  valid.  We  look  for  exponential  stretching  of  the 
form  \\hn\\2  =  (ll^o||eLt)2  where  L  denotes  the  Lyapunov  exponent.  Rearranging  and  taking 
the  limit  as  t  oo  leads  to 


L  =  lim  - —  In 
n-»oo  4nr 


(22) 


where  we  have  substituted  in  t  =  2 nr  with  2 r  denoting  the  decorrelation  time  associated 
with  the  two  stage  random  wave  model  of  (19-20).  Since  J  describes  a  random  process,  we 
ensemble  average  (denoted  as  (•))  over  the  random  variables  <pn  and  Xn  so  that  L  does  not 
depend  on  a  particular  realization.  Additionally,  we  use  the  identity  ||  Jnh0||2  =  (Jn  3n)h0 
to  simplify  (22).  The  resultant  definition  of  the  Lyapunov  exponent  used  herein  is 


L  =  lim  - —  In 

n— >oo  47 IT 


fi%{K)h0\ 

V  hjh0  ) 


(23) 


where  K  =  JnTJn.  Note,  for  convenience,  we  have  taken  (K)  rather  than  (InK).  The 
ramifications  of  this  subtle  difference  stem  from  the  fundamental  differences  between  additive 
and  multiplicative  random  walks  as  detailed  by  Redner  [10];  but  it  is  not  a  primary  concern 
of  the  present  study. 


6.1.2  Lyapunov  exponent:  two  stage  random  wave  model 

We  now  want  to  explicitly  calculate  the  Lyapunov  exponent  defined  in  (23)  for  the  two 
stage  random  wave  model  given  by  (19-20).  During  the  nth  step  of  the  random  walk,  Jn  = 
Jn(Xn)Ji(0n)>  where  Jt  and  Jn  describe  advection  in  the  x  direction  (first  stage)  and  y 
direction  (second  stage),  respectively.  Prom  (19-20), 


3i{4>n) 


1  sin(y„  +  <j>n)r 
0  1 


and  Jn(xn) 


1  0 
-  sin(2Cn+i  +  Xn)r  1 


(24) 
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The  independence  of  each  step  allows  the  ensemble  average  of  K  after  n  steps  to  be  written 
as 


(K)  =  <mx J?,(Xi)  •  ■  •  <J7(^)<JS(x»)Jh(x-))JiW.))  •  •  •  Jii(Xi))Ji(^i)).  (25) 

Introducing  the  diagonal  matrix  T=  [g  °],  the  inner  most  ensemble  average  of  (K)  can 
be  written  as  (Jn(Xn)  r  Jh(x„)).  Upon  calculating  several  sequential  ensemble  averages,  the 
recurrsion  relation  for  a  =  (a,  b )  becomes  apparent.  After  n  steps,  we  find 


Gn+l 

1 

I—1 1 

41. 

bn+l. 

Lt  l  +  ^J 

pn_ 

5 


(26) 


where  ao  =  bo  =  l-  With  this,  the  general  form  of  the  Lyapunov  exponent  (23)  reduces  to 


L  =  lim  - —  In 

n — >oo  4 TIT 


[hi  h2 


CLji  0 

0  bn 


) 


(27) 


In  order  to  represent  an  in  terms  of  Go  we  need  to  solve  the  corresponding  eigenproblem, 
Mv  =  Xv,  with  M  as  given  in  (26).  Due  to  space  limitations,  we  do  not  provide  the  details 
of  this  calculation.  The  main  result  is  an  =  RAnRTa0,  where  R  is  the  rotation  matrix, 

R=  f  vt  VL 1 ,  and  An  =  f A+  ®n  1 .  After  performing  the  algebra,  we  find 

L  vT  v9  J  L  0  A  J 


an  =  T2 - 1  [(16  +  t4  +  (t2  -  4)\/l6  +  r4)(8  +  r4  -  r2\/l6  +  r4)71  + 

16  +  r4  L 

(16  +  r4  -  (r2  -  4)v'l6  +  r4)(8  +  r4  +  r2\/l6  +  r4)n]  ,  (28) 


bn  =  2 


— 1— 3n 


(4  +  r2  +  VlG  +  r4)(8  +  r4  +  r2Vl6  +  r4)n 
Vl6  +  r4 


(r2  —  4  +  Vl6  +  r4)(8  +  r4  —  r2\/16  +  r4)r 
16  +  r4  +  r2\/l6  +  r4 


(29) 


Notice,  an  and  bn  are  not  equal;  therefore,  stretching  is  anisotropic  in  this  flow  field.  In  fact, 
looking  at  the  simple  case  of  n  =  1,  ax  =  1  +  t2/2  and  bx  =  1  +  t2/2  +  t4/4;  we  see  that  more 
stretching  occurs  in  the  y  direction  as  compared  to  the  x  direction.  However,  at  long  times, 
taking  the  limit  as  n  ->  oo,  we  find  that  the  stretching  does  become  isotropic  as  shown  in 
the  following.  We  rewrite  (27)  as 


L  =  lim 


n-+ oo  4 nr 


■In 


+  lim 


n-*oo  Anr 


ln(fen). 


(30) 
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Figure  3:  Ratio  of  the  two  diagonal  terms  Figure  4:  Lyanpunov  exponent  L  as  a 

in  the  stretching  matrix  as  a  function  of  function  of  turbulent  decorrelation  time  r. 

steps  n  in  the  random  map. 

As  n  — >  oo,  +  v/164fr4 .  Figure  3  shows  an/bn  as  a  function  of  n  for  a  family  of 

three  r  values.  We  see  that  for  any  given  r,  an/bn  asymptotes  to  a  constant  value.  Therefore, 
in  the  limit  as  n  oo,  the  first  term  in  (30)  goes  to  zero.  At  large  n,  bn  remains  as  the  only 
contribution  to  L  and  thus  stretching  becomes  isotropic.  From  figure  3,  one  can  determine 
how  quickly  the  flow  becomes  isotropic  for  a  given  r.  We  see  that  for  r  =  3,  stretching 
becomes  isotropic  after  the  first  step. 

With  further  manipulation  of  (27)  we  obtain  the  functional  relation  between  L  and  r 


_1_ 
4  r 


In 


(1  +  r4/ 8)  +  —  \/t4  +  16 

o 


(31) 


Figure  4  shows  the  graphical  representation  of  (31)  compared  with  that  obtained  for  the 
rennovating  random  wave  model  of  (21).  Maximum  stretching  in  the  two  stage  random  wave 
model  occurs  at  r  =  3.64  in  contrast  to  r  =  3.94  for  the  one  stage  random  model.  In  general, 
the  two  stage  random  wave  model  generates  more  stretching  of  the  fluid  elements. 


6.2  Diffusivity  of  fluid  particles 

We  follow  Einstein’s  theory  of  Brownian  movement  [11]  to  determine  the  diffusivity  associated 
with  the  movement  of  fluid  particles  in  the  random  wave  model  of  (19-20).  For  a  random 
walk  processes  in  two  dimensions,  Einstein  showed  that 


A2n  =  4  Defft,  (32) 

where  A  is  the  particle  displacement,  an  overline  denotes  an  average  performed  over  n  steps 
in  the  random  walk,  and  Deg  is  the  effective  diffusion  coefficient  that  appears  in  the  scalar 
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diffusion  equation.  Since  each  of  the  steps  in  the  random  walk  is  independent,  A2n  =nA2  = 
A 2t/r,  where  t  is  the  total  time  and  r  represents  the  decorrelation  time  of  the  random  walk.  In 
our  particular  case,  particle  displacements  during  the  nth  step  can  be  written  as  r2  =rfn  +rfIn , 
where  r,,  ru  are  the  displacements  during  the  x,  y  stages  of  advection,  respectively.  Without 
loss  of  generality,  we  can  make  the  coordinate  translation  (xn,  yn)  ->  (0,0)  before  each  nth 
step.  Applying  this  simplification  to  (19-20)  gives 

r2  =  r2  cos  2((f>n)  +  r2  cos2(-  cos  (0n)r  +  Xn)-  (33) 

Because  we  want  the  calculation  to  be  independent  of  the  random  phase  angles  associated 
with  a  particular  realization,  we  ensemble  average  over  both  (j)  and  x  to  yield  (r2)0  x  =  r2. 
Note,  in  analogy  to  (32),  A 2  =  ('r2)<p,x-  Taking  the  average  of  (r2)^jX  over  n  total  steps  in  the 
random  walk  and  using  (32)  gives  the  resultant  particle  displacement  diffusivity  of  Deff  =  f  • 
It  is  interesting  to  note  that  this  diffusivity  is  identical  to  that  of  the  rennovating  random 
wave  model  (21)  where  advection  is  performed  in  a  single  step.  At  this  point,  there  is  some 
confidence  in  our  understanding  of  the  model  flow  field;  we,  therefore,  proceed  to  study  the 
behavior  of  the  scalar  dispersion  equation  (10)  under  the  two  limiting  parameter  conditions 
of  h  —  0  and  h  oo. 

7  Zero  Dispersion  Limit 

In  the  case  h  —  0,  the  scalar  dispersion  equation  (10)  along  with  (18)  reduces  to 

At  —  COS  (y  +  4>)AX  —  —  sin  (y+<f>)A  =  0,  2nr<t<(2n+l)T,  (34) 

A  t  -  COs(x  +  x)Ay  -  -  sin(a:  +  xM  =  0,  (2n+l)r<t<(2n+2)r.  (35) 

We  solve  the  above  set  of  equations  using  the  method  of  characteristics.  In  the  first  stage  of 
the  random  wave  model,  we  define  new  variables  x  =  x  +  cxt,  t  =  t,  y  =  y;  while  in  the  second 
stage,  we  define  y  —  y  +  cut ,  t  =  t,  x  =  x  where  cx  =  cos(y  +  <f> )  and  cn  =  cos(x  +  x)-  This 
effectively  removes  the  advective  terms  from  (34)  simplifying  the  problem  to  two,  uncoupled, 
first  order  ordinary  differential  equations,  one  at  each  advection  stage.  The  solutions  valid 
in  the  first  and  second  stages  are 

A,  =  Aoei/^My+4>)i  and  A  =  Aiei/2sin(x+x)i,  (36) 

respectively,  where  A0  represents  the  initial  condition  at  the  beginning  of  the  nth  step.  Recall, 
at  n  =  0,  A(x.  y,  t =0)  =1.  Using  the  fact  that  x  and  y  remain  constant  along  characteristic 
curves,  the  solution  can  be  written  in  terms  of  an  iterated  map 

2nr<t<(2n+l)r  :  Aln+1  =  An  ei/2sin(^+^)T,  xn+1  =  xn  -  cos (y„  +  </>n)r,  (37) 

(2n+l)r<i<(2n+l)r  :  An+1  =  Aln+1  ei/2sin(x"+l+*")T,  yn+1  =  yn  +  COs(xn+1  +  Xn)r.  (38) 
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Figure  5:  Snapshots  of  the  spatial  structure  of  5R(.4)  for  the  r  =  1,  h  =  0.  Recall  that  at 
n  =  0,  5ft(4)  =  1. 


Figure  5  displays  four  snapshots  of  5ft  (A)  as  computed  from  the  map  above  for  the  case  of 
r  =  1.  The  resolution  is  1028x1028.  Later,  we  will  compare  these  pictures  to  the  structure 
of  5ft(A)  for  h  ^  0.  From  (37)  and  (38),  it  is  apparent  that  no  mechanism  exists  to  instigate 
changes  in  |A|;  therefore,  |yl|  remains  constant  at  its  initial  value  of  unity.  This  agrees  with 
the  previous  results  of  section  5.  However,  the  phase  of  A,  denoted  by  0  as  in  (11),  does 
exhibit  interesting  behavior.  The  iterated  map  for  $  follows  directly  from  (37-38)  as 

0„+i  =  0n  +  sm{yn  +  <pn)^+  sin(xn+1  +  x„)^,  (39) 

where  xn+1  and  yn+1  are  given  as  in  (37)  and  (38).  We  observe  that  for  h  =  0.  6  undergoes 
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a  random  walk  process  and  therefore  has  an  associated  diffusivity  Dg  (not  to  be  confused 
with  the  diffusivity  of  particle  trajectories).  To  calculate  Dg,  we  follow  the  procedure  in 
section  6.2.  The  ensemble  averaged  variance  of  0  is  defined  as  (0,2)</>,x  =  {(0n+i  —  0n)2)- 
Substituting  in  (39)  and  performing  the  average  yields  (0'2)  =  t2/4.  Making  the  analogy 
with  Einstein’s  theory  in  (32),  we  have  A2  =  Taking  the  average  of  A2  over  n  total 

steps  in  the  random  walk  and  using  (32)  t  gives  the  resultant  phase  diffusivity  Dg  =  r/ 16. 


Figure  6:  Phase  of  A  for  the  case  of  H  =  0,  r  =  1  after  25  random  advection  steps. 

This  is  an  interesting  result  in  that  the  behavior  of  0  for  zero  dispersivity  undergoes  a 
normal  diffusion  process  analogous  to  that  of  a  real  passive  scalar.  The  structure  of  0  after 
25  iterations  of  the  map  in  (39),  for  r  =  1,  is  shown  in  figure  6.  Note,  in  order  to  obtain  a 
continous  field,  results  axe  plotted  as  0/2ir.  The  initial  condition  at  t  =  0  is  0  =  0  and  the 
resolution  is  1028  x  1028.  This  picture  looks  surprisingly  similiar  to  the  stirring  of  a  passive 
scalar  as  presented  in  [8]  (figure  1  of  that  paper),  for  a  random  velocity  field  nearly  equivalent 
to  that  of  (18). 

8  Strong  Dispersion  Limit 

We  use  a  simple  multiscale  analysis  to  investigate  the  large  h  behavior  of  (10)  by  introducing 
a  small  parameter  e  and  a  slow  time  t'  such  that  h  =  0(  1/e)  and  t'  =  et.  Applying  these 
scalings  to  (10)  and  multiplying  by  e  gives 


Af>  +  eJ(^,  A)  +  —  — V2A. 

Tn  the  case  of  9,  we  have  a  one  dimensional  random  walk  given  by  A2„  =  2 Dest. 


(40) 
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We  assume  that  the  solution  can  be  written  as  a  power  series  expansion  in  e,  A  =  A0  + 
eA!  +  e2A2  +  . . .  .  The  0(1)  equation  is  obtained  by  substituting  this  expansion  into  (40) 
and  taking  the  limit  as  e  — >  0 


0(1):  A0t,  =  l-W2A0  (41) 

with  the  initial  condition  A0(t  =  0)  =  1.  The  solution  A0  =  1  trivially  satisfies  this  initial 
value  problem.  The  O(e)  equation  is  obtained  by  substituting  the  0(1)  solution  into  (40), 
dividing  by  e,  and  taking  the  limit  as  e  — >  0 


0(e):  =  (42) 

with  the  initial  condition  Aj(t  =  0)  =0.  Note,  J(^,  A0)  =  0.  Because  of  the  explicit  form 
of  (,  see  (18),  we  need  to  consider  the  two  stages  of  the  advection  process  separately.  This 
complicates  the  problem  in  that  the  initial  condition  for  each  advection  stage  depends  on  the 
final  state  at  the  end  of  the  previous  advection  stage.  We  start  by  considering  the  first  time 
interval  0  <  t  <  r,  during  which  ( =  —  sin(y  +  <f>)  and  (42)  reduces  to 


Ah,  -  =  -^sin(y  +  <£).  (43) 

with  (t  =  0)  =  0.  We  require  both  the  homogeneous  and  particular  parts  of  the  solution  to 
vanish  at  t  =  0.  Therefore,  the  homogenous  part  of  the  solution  is  simply  0.  We  assume  a 
particular  solution  of  the  form  A x  =  a(t')  sin(y  +  <f>).  Substituting  this  into  (43),  we  obtain 
an  ordinary  differential  equation  for  a(t'),  the  solution  of  which  is  a(t')  =  1  -  e~zt'/2.  Thus, 
in  the  first  time  interval, 


A  =  1  +  ^(1  —  e  2R)sin  (y  +  <j>)  +  0(e2),  0  <t<r.  (44) 

n 

In  the  subsequent  time  interval  r  <  t  <  2r,  (  =  -  sin(x  +  x)  and  the  initial  condition  is 
A1(t  =  r)  =  (l  -e-si) sin(y  +  <f>).  Following  the  same  procedure  used  above,  we  find 


A  =  1  +  i  1  +  e  T^(l  -  e  2R )  sin(y  +  4>)  +  l)  sin(r  +  x)  +  ^(e2)^ 


t  <  t  <2t. 

(45) 


One  thing  to  notice  in  the  strong  dispersion  limit  is  that  spatial  structure  in  A  develops  very 
slowly,  in  stark  contrast  to  the  case  of  h  =  0  where,  after  only  12  iterations,  the  structure  of 
A  has  become  highly  stretched  and  contorted  (see  figure  5).  We  will  return  to  these  results 
later  in  section  10  to  ascertain  how  the  energy  contained  in  the  large  scales  of  A  varies  as  a 
function  of  h.  We  now  consider  the  regime  h  ^  0. 


270 


9  Numerical  Solution  for  h  ^  0 

Both  the  spatial  domain  and  the  streamfunction  axe  2  it  periodic;  therefore,  we  seek  a  general 
solution  to  (10)  in  the  form  of  a  Fourier  series  expansion 


A=  f;  ak4t)eikx+iiy, 

k,l=— oo 


Ofc,£(0)  = 


if  k  =  £  =  0, 
otherwise 


(46) 


where  k  =  (k,£)  describes  the  wavenumber  vector  and  akj£(t)  are  the  corresponding  Fourier 
coefficients.  Substituting  (46)  into  (10)  yields 


°o  •  *1  •«.  oo 

J2  ak,t  +  iWxak>t-ik*yak/+l-V2yakj  e**  =  -y  £  {k2  +  £2)ak^s .  (47) 


k,l~  —  OC 


ky£=— oo 


In  order  to  eliminate  the  summations,  we  utilize  the  fact  that  the  Fourier  modes  are 
orthogonal.  To  exploit  this,  we  multiply  (47)  by  e~iqy~tpx  and  integrate  over  y  =  0-2-7T  and 
x  =  0-27T.  Integrations  are  performed  in  detail  in  the  Appendix.  It  is  worth  noting  that  if 
we  did  not  chose  a  two  stage  random  wave  model  for  the  velocity  field,  but  rather  used  a 
single  stage  model  such  as  (21),  then  it  would  not  be  possible  to  analytically  integrate  the 
terms  resulting  from  (47).  Performing  the  integrations  yields  two  sets  of  coupled,  first  order 
ordinary  differential  equations  for  ak/(t),  each  valid  in  one  of  the  two  stages  of  the  advection 
process 


d*,<  +  oW+i  e-*  (i-jp)  -  e<*  (12^*)  =  -iV  +  <?2K<,  (48) 

dM  +  »*+«<r‘*  (!+!!£)  (1^)  =  ^-(k2  +  (49) 

for  n  =  0, 1, . . .  and  k,  £  =  — oo, ...  ,  — 1, 0, 1, . . .  ,  oo  where  (48)  and  (49)  are  valid  during 
the  time  intervals  2 nr  <  t  <  (2 n  +  l)r  and  (2n  +  l)r  <  t  <  (2 n  +  2 )r,  respectively.  The 
apparent  coupling  between  nearest  neighbors  of  a  results  directly  from  the  fact  that  the 
imposed  velocity  field  contains  only  one  Fourier  component,  the  lowest  nonzero  wavenumber 
component.  A  convenient  aspect  of  the  two  stage  random  wave  velocity  field  is  that,  in  the 
first  stage,  coupling  occurs  only  between  £  wavenumbers;  while,  in  the  second  stage,  coupling 
occurs  only  between  k  wavenumbers. 

We  solve  (48-49)  numerically  to  obtain  the  time  evolution  of  the  Fourier  coefficients  ak^, 
then  utilize  an  inverse  fast  Fourier  transform  (FFT)  algorithm  to  perform  the  resummation  in 
(46)  to  obtain  A.  In  the  numerical  solution,  k,  £  must  be  truncated  at  the  Nth  Fourier  mode, 
i.e.  the  summation  appearing  in  (46)  occurs  over  —N  <  k,£  <  N.  We  chose  N  such  that 
the  amplitudes  of  the  corresponding  Fourier  modes  at  \k\,  \£\  >  N  have  decreased  below  a  set 
tolerance.  In  practice,  though,  we  typically  over  resolve  the  Fourier  domain  by  a  substantial 
amount  since  we  favor  a  grid  size  of  2px2p,  p  =  0, 1,2, . . .  . 
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Figure  7:  Snapshots  of  the  spatial  structure  of  3?(.4)  for  the  r  =  1,  h  =  1.  Recall  that  at 

t,  =  0.  »(i4)  =  1. 

The  two  truncated  systems  can  each  be  described  by  a  matrix  equation,  written  in  the 
general,  compact  form 

y{t)  =  f{t,y{t)).  (50) 

For  example,  considering  the  simple  case  of  AT  =  2,  (50)  becomes 


(51) 
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where  b  —  (1  +i2k)/4el<t>n ,  c=  -ih/2(k2  +£2),  d  =  —(1  -i2k)/4e~l^n ,  in  the  first  stage;  and 
b  =  (1  —  i2£) /4 etXn ,  c  =  —ih/2(k2  +  £2),  d  =  —(1  +  i2£)/4e~lXn ,  in  the  second  stage.  We  use 
the  tridiagonal  structure  of  (51)  to  our  advantage  in  selecting  a  discretization  method.  The 
second  order,  fully  implicit  Adams-Moulton  method  is  used  in  the  present  study.  Applying 
this  discretization  method  to  (50)  gives 


2/m+l  Vm 

St 


/, 


m+1 


2 


(52) 


The  numerical  method  conveniently  perserves  the  tridiagonal  structure  of  the  original 
truncated  system.  At  each  time  step,  we  solve  two,  uncoupled  linear,  tridiagonal  systems 
for  ak,e{t)  (one  during  each  advection  stage)  with  an  efficient  tridiagonal  system  solver.  A 
numerical  C  code  was  written  to  compute  the  time  evolution  of  a,k,e  and  perform  the  sub¬ 
sequent  inverse  FFT  to  obtain  the  spatio-temporal  structure  of  A.  Since  C  does  not  have 
built-in  capability  for  handling  complex  numbers,  special  functions  were  written  to  deal  with 
complex  number  operations. 

There  are  some  additional  comments  worth  mentioning  regarding  the  present  numerical 
scheme.  At  each  time  step  we  verify  that  the  code  conserves  \A\2  over  the  spatial  domain 
(refer  to  section  5)  by  tracking  a0,o-  The  deviation  of  o0,0  from  the  expected  value  of  unity 
is  never  greater  than  1  x  10-10.  Additional  calculations  regarding  the  stability  and  accuracy 
of  the  present  numerical  scheme  were  performed;  however,  due  to  space  limitations,  we  do 
not  provide  those  details.  The  analyses  stem  from  a  comparison  between  the  solution  to 
the  discretized  equation  and  the  analytical  solution  obtained  in  section  8.  We  found  the 
numerical  method  to  be  unconditionally  stable.  Furthermore,  we  found  that  the  minimum 
time  step  required  to  achieve  a  specified  accuracy  depended  on  h  and  the  magnitude  of  the 
wavenumber  k2.  For  example,  to  achieve  98%  accuracy  in  the  highest  wavenumber  component 
of  the  numerical  solution  for  K  =  1,  a  minimum  time  step  of  0.005  is  required. 

Figure  7  displays  four  snapshots  of  the  spatial  structure  of  9?(A)  for  the  case  of  h  =  1, 
r  =  1.  The  resolution  in  each  picture  is  1028x1028.  Figure  8  displays  the  numerical  results 
from  a  comparison  study  between  four  different  parameter  values  of  h=0.  0.1,  1,  and  10. 
Only  the  real  part  of  A  is  shown,  although  the  imaginary  part  exhibits  similar  structure.  The 
snapshots  are  taken  at  n  =  10  with  r  =  1.  All  computations  utilized  the  same  random  data 
set  for  (f>  and  %. 


10  Energy  in  Large  Scales 

We  now  take  a  look  at  how  the  energy  in  the  large  scales  decays  in  time  as  a  function  of  K. 
Recall  from  section  5  that  (AA*)  represents  a  conserved  quantity  where  (•)  denotes  a  spatial 
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Figure  8:  Comparison  of  the  spatial  structure  of  5R(A)  for  four  different  h  values  of  0,  0.1,  1, 

10.  All  snapshots  were  taken  at  n  =  10,  r  =  1  and  utilize  the  same  random  data  set  for  0,  x- 

average  and  the  superscript  *  denotes  the  complex  conjugate.  Therefore,  we  will  define  the 
energy  associated  with  the  complex  scalar  A  as 


e  =  (A)(A*).  (53) 

Figure  9  shows  the  results  of  e  for  three  different  cases  of  h  —  0,  0.01,  1  as  computed  from 
the  numerical  code  presented  in  section  9.  The  results  are  for  one  particular  realization  only. 
The  exponential  prediction  shown  stems  from  the  hypothesis  that  the  spatial  average  in  the 
definition  of  (53)  can  be  replaced  by  an  ensemble  average,  i.e.  (.A)  =  E[e*e],  where  E[-] 
denotes  the  expectation  of  the  random  process.  For  the  case  of  h  =  0,  we  showed  in  section  7 
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Figure  9:  Decay  of  energy  in  the  large 
scales  of  A  for  r  =  1. 


Figure  10:  Expanded  view  of  figure  9 
in  the  region  near  t  =  0. 


that  6  obeys  a  random  walk;  therefore,  the  probability  of  6  is  Gaussian.  According  to  the 
prediction, 


roo  o-02/ 2a2 

(A)  =  /  ■  = . ei6dd  =  e"*  /2  =  e~Det,  (54) 

J-oo  v27rcr2 

where  Dg  is  the  diffusivity  of  the  phase  as  calculated  in  section  7.  Obviously  from  figure  9, 
the  prediction  (54)  fails  to  describe  the  actual  behavior  of  e. 

From  our  analysis  of  the  strong  dispersion  limit,  we  know  that  as  h  — »  oo,  e  remains 
constant  at  the  initial  value  of  1.  This  yields  an  interesting  picture  of  the  decay  of  the  large 
scale  energy  as  a  function  of  h.  At  h  =  0,  the  rate  of  decay  of  e  is  faster  than  exponential. 
As  h  increases  from  0,  the  rate  of  decay  becomes  faster  than  that  for  h  =  0.  However,  at 
some  critical  value  of  li  >  1,  the  rate  of  energy  decay  becomes  slower  than  exponential  and 
eventually  becomes  zero,  since  e  =  1  for  all  time  as  h  — >  oo. 

11  Conclusion  and  Future  Work 

The  main  focus  of  the  present  study  has  been  the  attempt  to  understand  some  of  the  funda¬ 
mental  properties  of  the  passive  scalar  dispersion  equation  (10)  in  the  context  of  a  random 
wave  model  (18)  for  the  two  dimensional  background  turbulent  velocity  field.  We  have  found 
that  the  dispersivity  parameter  h  greatly  affects  the  spatiotemporal  structure  of  the  complex 
scalar  A.  For  h  =  0,  both  the  real  and  imaginary  parts  of  A  become  highly  stretched  and 
contorted  even  after  only  10  iterations  of  the  random  wave  field.  In  contrast,  for  the  case  of 
h  =  1,  the  spatial  structure  of  A  looks  blotchy  with  little  indication  of  stretching  or  amplifi¬ 
cation  of  the  gradient  of  A.  As  h  increases,  the  time  evolution  of  the  spatial  structure  of  A 
becomes  increasingly  slower.  Below  a  critical  value  hc,  energy  in  the  large  scales  of  A  decays 
faster  than  exponential;  while  for  cases  of  h  >  hc,  the  energy  in  the  large  scales  decays  slower 
than  exponential.  In  fact,  as  h  -¥  oo,  the  energy  in  the  large  scales  remains  constant  at  a 
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value  of  unity.  A  useful  extension  of  the  present  study  would  be  to  quantify  the  structure  of 
A,  visualized  herein,  using  probability  density  functions.  Another  natural  direction  for  future 
work  would  be  to  determine  whether  A  exhibits  a  Batchelor  scale.  In  other  words,  is  there 
a  limit  to  the  smallest  scales  of  A  achievable  in  the  flow?  Additionally,  if  there  is  a  cascade 
of  energy  from  large  to  small  scales,  then  what  are  the  relevant  scalings  associated  with  the 
spectra  and  how  do  these  compare  with  the  case  of  passive  scalar  diffusion? 

Finally  with  regard  to  near  inertial  oscillations,  from  the  vertical  normal  mode  decompo¬ 
sition  presented  in  section  4  along  with  figure  2,  we  recognize  that  large  h  corresponds  to  low 
wavenumber  vertical  motions  while  h—> 0  corresponds  to  high  wavenumber  vertical  motions. 
It  remains  somewhat  unclear,  though,  how  one  can  directly  apply  the  present  observations 
regarding  the  variation  in  the  decay  of  energy  of  A  with  h  toward  further  understanding  the 
behavior  of  the  NIO  velocity  field  in  the  oceanic  mixed  layer.  Recall  that  in  order  to  relate 
the  NIO  velocity  field  to  the  results  of  A  presented  here,  we  first  need  to  compute  A  using 
the  superposition  of  vertical  normal  modes  and  then  apply  the  operator  L.  This  is  left  as  a 
task  for  future  work. 
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13  Appendix 

Here,  we  detail  the  integrations  resulting  from  the  Fourier  series  expansion  of  A  as  described 
in  section  9.  The  corresponding  terms  in  (47)  will  be  referred  to  sequentially  as  (I)-(V) 
starting  from  left  to  right.  Each  term  is  multiplied  by  e~tqy~ipx  and  integrated  over  y  —  0-27T 
and  x  =  0-27T.  Due  to  orthogonality  of  the  Fourier  modes, 

r27r  r2n 

/  e^k~p)xdx  =  2itSkp  and  /  e^-^dy  =  2n  6£q,  (55) 

J  x— 0  Jy—0 

where  S  is  the  kronecker  delta.  The  two  stages  of  the  advection  process  are  considered 
separately.  We  only  outline  integrations  for  the  first  stage;  those  for  the  second  stage  follow  in 
a  similar  manner.  The  first  advection  stage  occurs  during  time  intervals,  2 nr  <t<  (2n+l)r, 
n  =  0, 1, . . .  with  the  streamfunction  given  by  '£  =  sin(y  +  <j>).  Terms  in  (47),  excluding  term 
(II)  which  is  identically  zero,  are  as  follows: 
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(Ill) 
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Note,  terms  similar  to  cos(y+<£)  ei(-e~^y  can  be  easily  integrated  by  rewriting  the  trigonometric 
part  as  an  exponential,  e.g.  l/2(e^y+^  +  e_I(y+0)). 
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The  temperature-salinity  relationship  in  the  mixed  layer 


Raffaele  Ferrari 


1  Introduction 

In  the  surface  mixed  layer  (ML)  of  the  ocean  there  is  a  remarkable  correlation  between  the 
horizontal  temperature  and  salinity  gradients.  The  goals  of  this  project  are  (1)  to  investi¬ 
gate  if  these  correlations  are  the  result  of  processes  at  work  within  the  ML,  (2)  to  develop 
parameterizations  of  these  processes  to  be  used  in  large-scale  ocean  models. 

Observations  show  abundant  examples  of  horizontal  fronts  with  temperature  and  salinity 
that  oppose  in  their  joint  effect  on  density  on  scales  of  10  m  to  100  km  [1],  [2],  [3].  A  useful 
measure  of  the  degree  of  compensation  is  the  density  ratio  R,  defined  as  the  ratio  of  the 
relative  effect  of  temperature  and  salinity  on  a  density  front.  Stommel  and  Chen  ([4],  [5]) 
computed  density  ratios  from  large  scale  meridional  temperature-salinity  (T  -  S)  gradients  in 
the  range  of  latitudes  between  20°  and  50°  and  concluded  that  R  has  a  mean  close  to  1.7,  even 
though  individual  fronts  can  have  a  density  ratio  markedly  different  from  1.7.  These  results 
are  obtained  from  climatological  data  sets  and  refer  to  density  ratios  on  scales  of  thousands 
of  kilometers  averaged  over  a  number  of  years.  On  smaller  horizontal  scales,  between  20  m 
and  10  km,  Rudnick  and  Ferrari  find  that  the  ML  density  ratio  is  1  and  not  2  [3].  These 
observations  imply  that  the  mean  density  ratio  on  large  scale  is  1.7,  even  though  typical 
thermohaline  gradients  at  small  scale  have  an  instantaneous  density  ratio  of  1. 

One  possible  explanation  of  the  correlations  between  thermohaline  gradients  in  the  ML  is 
that  atmospheric  forcing  and  entrainment  of  thermocline  waters  create  and  juxtapose  water 
masses  with  compensating  T-  S  gradients.  However  the  ratio  of  heat  to  freshwater  buoyancy 
fluxes  is  variable  in  large  scale  maps-  and  in  time  series  at  a  point  and  there  is  no  evidence 
that  these  fluxes  force  a  small  scale  frontal  density  ratio  of  1  and  a  large  annual  mean  density 
ratio  of  1.7.  In  this  project  we  propose  an  alternative  interpretation  that  relies  on  regulating 
mechanisms  in  the  ML  to  create  correlations  between  temperature  and  salinity,  regardless  of 
the  large  scale  atmospheric  forcing.  Furthermore  we  show  that  the  processes  responsible  for 
these  correlations  are  not  properly  represented  in  large-scale  ocean  models  and  we  suggest 
more  appropriate  parameterizations. 

2  Nonlinear  diffusive  parameterizations  of  eddy  transfer  of 
heat  and  salt  in  the  mixed  layer 

The  basic  idea  of  this  study  is  straightforward:  temperature  and  salinity  are  dynamically 
active  in  ML  because  they  contribute  to  density  gradients.  All  processes  that  depend  on 
density  gradients,  and  not  on  temperature  and  salinity  gradients  separately,  are  potentially 
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capable  of  creating  T—S  relations,  because  they  act  only  on  T-S  fluctuations  that  reinforce  in 
their  joint  effect  on  density.  In  order  to  test  this  idea,  we  first  derive  parameterizations  of  the 
ML,  which  use  density  gradients  as  the  driving  field,  and  then  we  examine  the  consequences  of 
these  parameterizations  on  the  distributions  of  temperature  and  salinity  in  numerical  models. 

Let  us  consider  the  dispersion  of  some  tracer  of  concentration  0(x,y,z,t)  in  the  ML.  We 
model  the  ML  as  a  vigorously  mixed  shallow  layer  characterized  by  a  high  aspect  ratio,  i.e. 
with  a  depth  H  much  less  than  the  horizontal  scale  L.  The  main  point  here  is  that  there  are 
two  very  different  time  scales:  a  fast  time  scale  over  which  the  layer  is  mixed  vertically  and  a 
slow  time  scale  associated  with  horizontal  transports.  We  argue  that  in  order  to  describe  the 
lateral  dispersion  of  the  tracer  on  the  slow  time  scale,  it  is  not  necessary  to  resolve  the  details 
of  the  processes  active  on  the  fast  time  scale.  The  combined  action  of  small  scale  stirring 
stirring  and  vertical  mixing  can  be  parameterized  as  a  diffusion  of  the  vertically  averaged 
tracer. 

A  mathematical  model  for  the  horizontal  transport  of  tracers  in  the  ML  is  formulated 
decomposing  8  in  a  “mean”  -  denoted  by  9  -  defined  as  the  average  over  the  depth  H  of  the 
ML  and  over  a  period  long  compared  to  the  time  scale  of  vertical  mixing,  and  in  an  “eddy”  - 
denoted  by  O'  -  defined  as  the  departure  from  that  mean.  The  Reynold’s  averaged  equation 
for  a  conserved  tracer  is 


0t  +  u-  V#0  =  -  V  •  u'0' +  F.  (1) 

Here  u  =  (u,  v )  denotes  the  horizontal  velocity  of  the  incompressible  Boussinseq  fluid.  The 
first  term  in  the  RHS  of  (1)  is  called  the  eddy  flux  divergence.  Its  net  effect  is  to  redistribute 
the  tracer  within  the  body  of  the  fluid.  The  second  term,  F,  represents  the  averaged  fluxes 
of  tracer  from  the  surface  of  the  ocean  (atmospheric  forcing)  and  through  the  base  of  the  ML 
(entrainment  of  thermocline  waters).  The  challenge  of  this  section  is  to  derive  a  long-term, 
large-scale  equation  for  the  mean  tracer,  by  expressing  the  eddy  fluxes  in  (1)  in  terms  of 
mean  quantities. 

Eddy  fluxes  of  a  conserved  tracer  can  be  parameterized  with  closures  based  on  local  mean 
gradients.  The  argument  goes  that  a  fluid  particle  carries  the  value  of  a  conserved,  and  hence 
transferable,  tracer  for  some  length  1',  before  it  is  mixed  with  its  new  surroundings.  If  the 
particle  was  initially  typical  of  its  surroundings  then  the  eddy  flux  of  tracer  8  is  given  by 

5^  = V0,  (2) 

where  it  is  assumed  that  V#  varies  little  over  distances  comparable  with  the  mixing  length 
1'.  The  tensor  u'l'  defines  the  eddy  diffusivity.  The  most  commonly  used  parameterization 
of  horizontal  eddy  transports  in  the  ML  is  to  assume  a  down-gradient  Fickian  diffusion, 

u7#7  =  -kV8,  (3) 

where  k  is  set  to  an  a  priori  value  constant  both  in  space  and  time.  However  down-gradient 
diffusion  follows  from  (2)  only  if  the  statistics  of  the  eddy  field  are  horizontally  homogeneous 
and  isotropic.  In  the  ML  lateral  eddy  transports  on  time  scales  longer  than  the  vertical  mixing 
time  result  from  hydrodynamical  instabilities  that  release  the  gravitational  potential  energy 
stored  in  horizontal  density  gradients.  The  dynamics  is  simple;  lateral  density  gradients 
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slump  under  the  action  of  gravity  and  drive  horizontal  eddy  fluxes.  The  eddy  velocity  u'  and 
the  eddy  displacement  1'  associated  with  the  release  of  potential  energy  must  be,  on  average, 
in  the  direction  of  the  density  gradients  and  larger  when  the  horizontal  density  gradients  are 
larger.  Therefore  in  the  ML  the  statistics  of  the  eddy  field  are  not  homogeneous  and  isotropic 
and  a  down-gradient  Fickian  closure  is  not  appropriate.  It  seems  natural  to  look  instead  for 
closures  that  include  the  effect  of  large  scale  density  gradients.  A  general  expression  for  the 
eddy  diffusivity  tensor  can  be  written  in  the  form, 

^  =  7/(|Vp|)VpVp,  (4) 

where  p  is  the  density  of  the  fluid,  7  a  constant  and  /(|Vp|)  a  function  whose  form  de¬ 
pends  on  the  details  of  the  hydrodynamic  instabilities  that  dominate  in  the  eddy  field.  The 
corresponding  flux  of  tracer  is, 

^=-7/(|Vp|)  (Vp-V0)Vp.  (5) 

Notice  that,  even  though  the  flux  is  in  the  direction  of  the  mean  density  gradient,  u  'O'-VO  <  0; 
thus  the  flux  of  tracer  tends  to  be  down  the  tracer  gradient.  With  (5)  we  can  write  down  an 
equation  that  describe  the  dispersion  of  the  mean  tracer, 

et  +  u  •  V0  =  7V  •  [/(|Vp|)(Vp  •  V0)Vp].+  F,  (6) 

where  we  dropped  the  overbars.  Hereinafter  we  assume  that  all  variables  are  averaged  over 
the  depth  of  the  ML  and  over  times  longer  than  the  vertical  mixing  time. 

We  can  now  turn  to  the  case  where  the  tracers  are  temperature  T  and  salinity  S.  Let  us 
express  density  as  p  =  po  [l  —  g~lB],  where  B  is  buoyancy.  Assuming  that  the  equation  of 
state  is  linear,  and  using  suitable  definitions,  the  buoyancy  is 

B  =  T-S.  (7) 

With  our  definitions,  S,  T  and  B  all  have  the  dimensions  of  acceleration.  The  nonlinear 
diffusion  equations  that  describe  the  buoyancy  driven  dispersion  of  heat  and  salt  in  the  ML 
follow  from  (6), 


Tt  +  u-VT  =  7V  •  [f(\VB\){VB  •  VT)VH]  +  FT,  (8) 

St  +  u-VS  =  7V-[/(|VS|)(VB-V5)VB]  +  F5,  (9) 

where  Ft  and  Fs  represent  the  forcings  on  temperature  and  salinity. 

Parameterizations  in  which  the  dynamics  depends  exclusively  on  horizontal  density  gradi¬ 
ents  are  not  new  to  the  oceanographic  literature.  The  idea  descends  from  Stommel’s  two-box 
idealization  of  the  thermohaline  circulation  [6].  Stommel  posited  a  transport  law  in  which 
the  exchange  of  mass  between  boxes  is  proportional  to  the  square  of  the  density  difference. 
Some  thirty  years  later  Stommel  and  Young  used  the  same  model  to  study  the  T  —  S  relation 
in  the  ML  [7].  Stommel’s  model  is  essentially  a  two-grid  points  discretization  of  (8)  and  (9) 
with  /(|VB|J  =  |V23|-1.  Other  classes  of  nonlinear  diffusion  models  have  been  developed 
for  the  full  range  of  space  and  time  scale  that  are  of  interest  to  oceanographers:  from  the 
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planetary  scale  of  the  thermohaline  circulation,  through  the  deformation  scale  dynamics  of 
baroclinic  eddies,  down  to  the  ageostrophic  circulations  in  shallow  water  systems.  Here  we 
limit  our  attention  to  models  for  the  upper  ocean.  The  redistribution  of  temperature  and 
salinity  in  the  ML  is  the  result  of  processes  that  release  the  potential  energy  stored  in  hori¬ 
zontal  stratification;  viz.  buoyancy  driven  shear  dispersion  at  scales  below  the  deformation 
radius  and  baroclinic  instability  at  larger  scales.  Our  goal  is  now  to  emphasize  the  unity  of 
the  physical  ideas  underlying  the  two  processes  and  to  show  that  both  can  be  described  by 
the  equations  in  (8)  and  (9). 

2.1  Buoyancy  driven  shear  dispersion 

W.  R.  Young,  in  the  first  part  of  this  volume,  discussed  a  class  of  models  to  parameterize 
the  transport  of  heat  and  salt  in  a  ML  idealized  as  a  shallow  system  with  strong  vertical 
mixing.  He  derived  a  set  of  nonlinear  diffusion  equations  for  T  and  5,  where  the  nonlinearity 
arises  because  the  horizontal  transport  of  heat  and  salt  is  by  shear  dispersion,  and  the  shear 
flow  doing  the  dispersing  is  driven  by  slumping  horizontal  buoyancy  gradients.  The  main 
point  of  his  presentation  was  that  the  lateral  diffusivity  in  such  a  ML  is  proportional  to  the 
horizontal  buoyancy  gradient  squared.  Young’s  model  is  a  particular  case  of  (8)  and  (9),  with 
/(|VR|)  =  1  and  no  vertically  averaged  flow,  u  =  (0,0), 

Tt  =  7V  •  [(VJ3  •  VT)VH]  +  Ft,  (10) 

St  =  7V  •  [(VB  •  VS)VR]  +  Fs.  (11) 

The  constant  7  depends  on  the  depth  H  of  the  ML  and  on  the  details  of  the  processes  doing 
the  vertical  mixing.  If  mixing  is  parameterized  as  an  intermittent  process  that  homogenizes 

the  ML  at  intervals  of  time  r,  one  finds  that  7  =  H2t3/ 96  [8].  Different  parameterization 

agree  on  the  functional  dependence  of  7  on  the  depth  H  and  the  characteristic  time  of  vertical 
mixing  r. 

It  is  important  to  discuss  the  range  of  oceanic  parameters  for  which  the  nonlinear  diffusion 
equations  (10)  and  (11)  might  apply.  Shear  dispersion  mechanisms  can  act  only  at  horizontal 
scales  larger  than  H,  say  100  m,  and  shorter  than  the  deformation  radius,  typically  10  km 
in  the  ML.  and  at  timescales  longer  than  the  vertical  mixing  time,  say  a  day.  Motions  with 
smaller  lengths  and  higher  frequencies  are  parameterized  as  “vertical  mixing”  (e.g.  Langmuir 
circulations  and  convective  overturning).  At  larger  scales  the  effects  of  rotation  become 
important  and  the  dynamics  change  substantially  as  shown  in  the  next  section.  For  H  =  100 
m  and  r  =  1  day  one  obtains  7  =  1017  m2s3;  we  use  this  as  a  reference  value  throughout  the 
report  whenever  we  make  quantitative  statements  about  our  results. 

2.2  Buoyancy  fluxes  produced  by  baroclinic  instability:  Green,  Stone  and 
Held 

At  scales  larger  then  the  Rossby  radius  of  deformation  eddies  produced  at  baroclinically  un¬ 
stable  buoyancy  gradients  can  dominate  the  transports  of  heat  and  salt  in  the  ML.  Green  used 
the  arguments  reviewed  at  the  beginning  of  this  section  to  show  that  the  transfer  properties 
of  baroclinic  eddies  can  be  parameterized  in  terms  of  their  large  scale  structure  [9].  Supposing 


282 


that  the  transfer  of  buoyancy  occurs  in  the  growing  phase  of  baroclinic  eddies,  when  energy 
is  extracted  by  the  eddy  from  the  baroclinic  zone  of  the  large  scale  flow,  Green  deduced  the 
expected  form  of  the  eddy  diffusivity  in  the  limit  of  large  Richardson  number.  He  then  used 
linear  stability  analysis  of  the  baroclinic  wave  to  determine  the  direction  of  the  eddy  fluxes. 
Stone  derived  a  similar  expression  from  linear  stability  analysis  and  extended  Green’s  results 
to  small  Richardson  numbers  [10].  Green  and  Stone  imagined  that  diabatic  processes,  like 
vertical  mixing,  restore  the  baroclinic  zone,  replenishing  the  supply  of  available  potential  en¬ 
ergy  until  it  is  discharged  again  by  baroclinic  instability.  This  repeated  conversion  of  energy 
leads  to  a  diapycnal  eddy  flux.  Green  and  Stone  attempted  to  relate  this  flux  to  large-scale 
parameters  in  zonally  averaged  models  of  the  atmosphere, 

w  =  -^(2  \B,\e„  (12) 

where  v'  is  the  meridional  eddy  velocity,  N  is  the  Brunt-Vaisala  frequency,  a  a  universal 
constant  of  proportionality,  and  l  a  measure  of  the  meridional  eddy  displacement.  Green 
argues  that  this  distance  is  set  by  the  width  of  the  baroclinic  zone.  Stone,  however,  suggest 
that  the  deformation  radius  is  the  appropriate  length  scale.  The  ideas  of  Green  and  Stone, 
however,  converge  on  predicting  that  the  baroclinic  eddy  fluxes  across  a  buoyancy  gradient 
are  driven  by  the  absolute  value  of  the  diapycnal  buoyancy  gradient.  It  then  makes  good 
physical  sense  to  extend  (12)  to  two  dimensions  as, 

uJ07=  (V0-  VB)VB.  (13) 

With  this  parameterization  of  lateral  eddy  transports,  the  dispersion  of  temperature  and 
salinity  in  the  ML  is  once  again  described  by  nonlinear  diffusion  equations  of  the  form  in  (8) 
and  (9):  in  this  case  /(|V2?|)  =  |VR|_1  and  7  =  al2N~x. 

Pavan  and  Held  used  the  results  of  numerical  simulations  to  test  diffusive  parameteriza- 
tiuns  of  the  buoyancy  fluxes  produced  by  baroclinic  instability  [11].  They  integrated  a  two 
layer  model  of  a  baro clinically  unstable  jet  to  obtain  a  series  of  statistically  steady  states 
for  different  jet  widths  and  evaluate  diffusive  approximations  of  the  eddy  buoyancy  fluxes. 
In  their  simulations  the  flow  is  forced  by  relaxation  to  an  unstable  prescribed  buoyancy  gra¬ 
dient.  Pavan  and  Held  found  that  a  flux-gradient  relationship,  deduced  by  a  simple  fit  on 
numerical  results,  predicts  the  magnitude  and  the  shape  of  the  eddy  fluxes  in  the  unstable 
jet  Hows  remarkably  well.  The  functional  form  of  the  relationship  differs  from  that  suggested 
by  Green  and  Stone;  for  large  buoyancy  gradients,  that  is  for  temperature  gradients  larger 
t  han  the  climatological  mean  of  6°  C  over  1000  km,  the  effective  diffusivity  is  proportional  to 
the  fourth  power  of  the  buoyancy  gradient.  In  the  presence  of  two  stratifying  components, 
like  heat  and  salt,  one  obtains  a  set  of  coupled  equations  for  temperature  and  salinity  of  the 
form  in  (8)  and  (9)  with  /(|VR|)  =  |VB|. 

The  main  conclusion  of  this  brief  review  is  that  different  diffusive  parameterizations  of 
tracer  fluxes  produced  by  baroclinic  instability  agree  on  one  point:  diapycnal  baroclinic  fluxes 
can  be  described  with  nonlinear  diffusive  closures  where  the  diffusivity  is  proportional  to  some 
power  of  the  buoyancy  gradient.  Differences  emerge  only  in  the  power  law  that  relates  the 
buoyancy  flux  to  its  gradient,  but  all  parameterizations  are  captured  by  the  general  form  in 
(8)  and  (9).  The  parameterizations  discussed  so  far  describe  the  eddy  transports  across  and 
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not  along  isopycnal  surfaces.  A  full  parameterization  of  baroclinic  instability  should  account 
for  the  epipycnal  fluxes  as  well.  However  our  goal  is  to  explain  how  a  density  ratio  of  one  is 
maintained  in  the  ML  at  small  scales,  even  though  the  ratio  of  heat  and  freshwater  fluxes  is 
extremely  variable  in  the  external  forcing.  Isopycnal  eddy  fluxes  do  not  release  the  energy 
stored  in  horizontal  buoyancy  gradients  and  thus  cannot  explain  compensation.  Therefore  we 
do  not  attempt  a  parameterization  of  epipycnal  fluxes  in  this  project,  but  it  remains  an  open 
question  to  verify  the  role  of  these  fluxes  in  creating  correlations  between  two  dimensional 
distributions  of  temperature  and  salinity. 

3  Temperature-salinity  correlations  at  small  scales  as  a  result 
of  nonlinear  diffusive  parameterizations 

We  begin  by  exploring  the  implications  of  the  nonlinear  diffusive  parameterization  of  the 
ML  in  an  idealized  setting.  Suppose  that  temperature  and  salinity  variations  are  created  at 
some  instant  by  atmospheric  forcing  or  entrainment  of  thermocline  waters  and  that  these 
horizontal  nonuniformities  disappear  as  a  result  of  nonlinear  diffusion.  This  is  a  rundown 
problem  in  which  there  is  no  external  forcing  and  the  down-gradient  diffusion  eventually 
erases  all  the  initial  fluctuations  in  temperature  and  salinity.  There  is  no  mechanism  to 
produce  new  randomness:  all  the  randomness  comes  from  the  initial  condition.  Equations 
(8)  and  (9)  reduce  to 

Tt  =  7V  •  [/(|VB|)(VJ3  •  VT)VB] ,  St  =  7V  •  [f(\VB\)(VB  ■  VS)VB] .  (14) 

Observe  that  the  free  decay  of  temperature  and  salinity  is  perfectly  symmetric  in  this  system. 
If  the  statistics  of  temperature  and  salinity  are  identical  in  the  initial  conditions,  then  no 
asymmetries  in  the  thermohaline  fields  can  develop  at  later  times.  One  might  then  expect 
that  this  rundown  problem  shows  only  the  progressive  decay  of  the  initial  temperature  and 
salinity  fluctuations.  However  apart  from  the  trivial  decay,  well  before  all  fluctuations  are 
erased,  nonlinear  diffusion  creates  nontrivial  correlations  between  the  thermohaline  fields  and 
between  their  gradients.  The  physical  reason  is  that  in  the  initial  conditions  there  will  be 
regions  in  which  the  temperature  and  salinity  gradients  will  happen  to  partially  compensate 
in  their  joint  effect  on  buoyancy.  In  those  regions  the  nonlinear  diffusion  will  be  small  and  the 
initially  compensating  gradients  will  persist.  Likewise,  the  initial  conditions  will  also  contain 
regions  in  which  the  thermohaline  gradients  accidentally  produce  large  buoyancy  changes; 
those  regions  are  then  subject  to  strong  diffusion  and  the  gradients  will  quickly  disappear. 
The  consequence  of  this  selective  decay  is  that  only  compensated  thermohaline  gradients 
persist  as  the  system  runs  down  and  a  density  ratio  of  1  is  established. 

3.1  One-dimensional  model 

We  start  by  considering  the  simplest  problem  in  which  the  temperature  and  salinity  have 
spatial  variations  in  only  the  y-direction.  Equations  (14)  then  reduce  to 

^=7(1^1%),,  St  =  l(\By\nSy)y,  (15) 
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where  n  is  an  integer  that  can  take  any  value  from  1  to  3  according  to  the  nonlinear  closure 
under  consideration. 

The  one-dimensional  case  is  sufficient  to  illustrate  the  development  of  thermohaline  cor¬ 
relations  from  random  initial  conditions.  Numerical  simulations  of  the  system  in  (15)  are 
carried  in  non  dimensional  variables.  Buoyancy,  temperature  and  salinity  are  measured  in 
terms  of  the  initial  RMS  buoyancy  variations  (Bo),  lengths  in  terms  of  the  initial  correlation 
length  (£o)j  and  time  in  terms  of  r7  =  r 7  is  the  time  it  takes  to  mix  tracers  over 

a  distance  lo  using  nonlinear  diffusion  driven  by  the  initial  RMS  buoyancy  gradients.  In 
terms  of  non  dimensional  variables  one  simply  sets  7  =  1  in  (15).  The  initial  conditions  are 
established  by  selecting  T  and  5  uncorrelated  at  each  grid  point  using  a  uniform  probability 
density  function  with  zero  mean  such  that 

(B2)  =  1,  <T2)  =  (S2>  =  i,  (16) 


where  ()  is  an  integral  over  the  domain 

(17) 

The  numerical  calculations  are  performed  by  integrating  the  non  dimensional  coupled 
equations  for  temperature  and  salinity.  Global  conservation  of  heat  and  salt  is  satisfied  by 
requiring  that  the  gradients  Ty  and  Sy  vanish  at  both  ends  of  the  domain  (i.e.,  ‘no-flux5 
boundary  conditions).  We  solve  the  system  in  (15)  on  a  discrete  spatial  grid  with  an  explicit 
Euler  forward  scheme  in  time  and  central  differencing  in  space.  The  time  step,  At,  is  short 
enough  to  accurately  solve  the  set  of  ordinary  differential  equations  obtained  by  the  spatial 
discretization  of  (15). 

The  creation  of  positive  correlations  between  temperature  and  salinity  and  their  gradients 
is  shown  in  a  series  of  T  -  S  and  Ty  -  Sy  scatterplots.  Figures  1,  2  and  3  show  the  results 
of  simulations  with  different  nonlinear  power  diffusivities,  respectively  n  =  1,  n  =  2  and 
n  =  3  in  (15).  In  all  three  cases  the  thermohaline  compensation  is  evident  as  an  extension 
of  the  cloud  of  points  along  the  “compensation  line”  Ty  =  Sy  at  time  t  =  3.  A  cloud  of 
points  collapsed  along  the  “compensation  line”  is  equivalent  to  a  density  ratio  of  1;  thus  dif¬ 
fusive  parameterizations  with  a  diffusivity  that  depends  on  the  horizontal  buoyancy  gradient 
correctly  predict  the  observed  density  ratio  of  1  at  small  scales.  The  reader  interested  in  a 
more  rigorous  analysis  of  these  simulations  is  referred  to  [8],  where  the  nonlinear  diffusion 
equation  with  diffusivity  n  —  2  is  analyzed  both  numerically  and  analytically.  Here  we  want 
to  stress  the  fact  that  compensation  is  typical  of  all  parameterizations  that  have  a  diffusivity 
that  grows  proportionally  with  the  horizontal  gradients  of  buoyancy.  Changing  the  power 
law  in  the  diffusive  parameterization  affects  the  aspect  ratio  of  the  cloud  of  points  in  the 
scatterplots,  but  not  the  orientation  of  the  major  axis. 

Finally  let  us  discuss  the  characteristic  time  over  which  compensation  occurs.  After 
the  passage  of  a  hurricane,  typical  temperature  gradients  can  be  as  large  as  0.1  °C  over  a 
kilometer.  The  shear  dispersion  model  for  a  mixed  layer  100  m  deep  and  a  vertical  mixing 
time  of  the  order  of  one  day,  shows  compensation  over  scales  of  a  kilometer  after  only  one 
hour.  Choosing  a  vertical  mixing  time  scale  of  a  third  of  a  day  gives  that  compensation  is 
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created  in  one  day.  More  importantly  compensation  on  scales  of  10  km  is  established  in  a 
time  100  times  slower.  Therefore  we  expect  compensation  to  be  restored  after  each  forcing 
event  on  scales  of  a  kilometer,  while  on  larger  scales  compensation  and  external  forcing  act 
on  comparable  time  scales.  This  explains  why  a  density  ratio  of  1  is  observed  only  up  to 
scales  of  at  most  10  kilometers. 

3.2  Two-dimensional  model 

In  the  previous  section  we  investigated  nonlinear  diffusive  parameterizations  in  one  dimension. 
We  now  extend  our  analysis  to  two  dimensional  models.  A  second  spatial  dimension  adds 
new  flavor  to  the  problem,  because  the  nonlinear  diffusion  depends  not  only  on  the  magnitude 
but  also  on  the  relative  orientation  of  the  thermohaline  gradients. 

Numerical  simulations  of  the  nonlinear  diffusion  equation  in  (14)  are  carried  in  dimension¬ 
less  variables  as  discussed  in  the  previous  section.  Conservation  of  heat  and  salt  is  satisfied 
by  imposing  periodic  boundary  conditions  at  the  edges  of  a  square  domain.  The  initial  con¬ 
ditions  have  random  and  uncorrelated  T  and  S  profiles.  The  numerical  integrator  uses  a 
third  order  Adams-Bashford  scheme  in  time  and  central  differences  in  space.  The  time  step 
is  chosen  short  enough  to  ensure  stability  and  accuracy  of  the  solutions. 

The  two  dimensional  rundown  problem  shows  that  correlations  develop  between  the  mag- 
nit  udes  of  the  thermohaline  fields  in  much  the  same  way  described  for  the  one  dimensional 
case.  Scatterplots  of  the  thermohaline  gradients,  computed  along  an  arbitrary  path  within 
the  domain,  collapse  along  the  compensation  line  during  the  first  phases  of  the  simulation 
and  look  exactly  like  those  in  figures  1,  2  and  3.  The  new  result  is  that  correlations  develop 
also  between  the  orientation  of  the  thermohaline  gradients.  Figure  4  shows  histograms  of 
the  angles  between  the  temperature  and  salinity  gradients  at  each  grid  point.  The  process 
of  "thermohaline  alignment”  is  extremely  fast  and  by  time  t  =  1  (of  the  order  of  one  day 
or  less  in  dimensional  units  for  typical  ML  gradients)  T  and  S  gradients  are  nearly  every¬ 
where  parallel.  Note  that  the  number  of  aligned  gradients  increases  with  time,  proving  that 
nonlinear  diffusion  actively  tilts  the  T  and  S  isolines  so  as  to  create  a  uniform  buoyancy 
held.  In  one  dimension  compensation  is  the  result  of  dissipation  of  all  buoyancy  gradients.  In 
two  dimensions  nonlinear  diffusion  can  also  create  compensated  gradients  not  present  in  the 
initial  conditions,  by  selectively  dissipating  those  components  of  thermohaline  gradients  that 
project,  on  density  gradients.  The  results  shown  in  figure  4  are  obtained  for  a  shear  dispersion 
closure,  that  is  by  substituting  /(|VB|)  =  1  in  (14).  Analogous  results  are  obtained  for  the 
other  forms  of  /(|VB|)  introduced  in  section  2. 

The  main  result  of  the  two  dimensional  simulations  is  that  nonlinear  diffusive  parameter¬ 
izations  of  the  ML  produce  compensation  between  the  gradients  of  temperature  and  salinity 
along  any  direction.  This  is  consistent  with  the  density  of  ratio  of  1  measured  by  profiling  the 
ML  along  one  dimensional  paths  [3].  It  remains  to  understand  why  compensation  and  a  den¬ 
sity  ratio  of  1  are  observed  only  at  small  scales  and  not  at  scales  of  thousands  of  kilometers. 
We  address  this  point  in  the  next  section. 
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Figure  1:  This  figure  shows  the  results  of  a  simulation  in  which  1000  points  in  the  ( S ,  T)  and 
(Sy,Ty)  planes  are  created  by  picking  uncorrelated  temperature  and  salinity  from  a  uniform 
probability  density  with  variance  1/2;  thus  the  variance  of  B  =  T  —  S  is  one.  The  nonlinear 
diffusivity  is  of  the  form  \By\  =  \Ty  —  Sy\.  The  upper  panels  show  scatterplots  at  time  t  =  0. 
The  lower  panels  scatterplots  at  t  =  3. 
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Figure  2:  As  in  figure  (1),  but  with  a  nonlinear  diffusivity  of  the  form  \B. 
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Figure  4:  Histograms  of  the  angles  between  the  temperature  and  the  salinity  gradients  at 
four  different  times  in  a  simulations  on  a  grid  of  100  x  100  points.  The  initial  conditions  are 
created  by  picking  uncorrelated  temperature  and  salinity  from  a  uniform  probability  density 
with  variance  1/2;  thus  the  variance  of  B  =  T  -  S  is  one.  The  thermohaline  fields  evolve 
according  to  the  equations  in  (14)  with  7  =  1  and  /(|VH|)  =  1. 
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4  Temperature-salinity  relation  at  large  scales  explained  with 
a  forced  nonlinear  diffusion  equation 

We  showed  how  correlations  develop  between  the  thermohaline  fields  as  a  result  of  nonlin¬ 
ear  diffusive  parameterizations  by  solving  initial  value  problems  in  which  temperature  and 
salinity  are  prepared  as  random  and  uncorrelated  distributions.  For  typical  ML  thermoha¬ 
line  gradients  these  correlations  are  established  on  time  scales  of  days  for  length  scales  of  a 
kilometer  and  much  slower  for  longer  length  scales.  One  then  expects  that  horizontal  density 
fluctuations  on  scales  of  a  few  kilometers  disappear  soon  after  they  are  created  by  forcing 
events  like  storms  or  wind  bursts.  This  is  consistent  with  the  experimental  evidence  that 
typical  thermohaline  gradients  on  scales  shorter  than  10  km  are  compensated  [3].  At  scales 
of  tens  of  kilometers  nonlinear  diffusion  is  not  as  efficient  and  it  does  not  restore  compensa¬ 
tion  between  one  forcing  event  and  the  next.  The  long  term  climatological  T  —  S  relationship 
and  the  large  scale  density  ratio  of  the  ML  are  then  the  result  of  the  competition  between 
nonlinear  diffusion,  that  dissipates  horizontal  density  gradients,  and  thermohaline  forcing, 
that  continuously  creates  thermohaline  anomalies. 

The  combined  action  of  surface  forcing  and  nonlinear  diffusion  on  large  scales  in  the  ML 
is  investigated  with  a  model  of  the  form  given  in  (8)  and  (9).  We  restrict  temperature  and 
salinity  to  have  spatial  variations  only  in  the  meridional  y-direction.  A  one  dimensional  model 
enables  us  to  perform  extensive  Monte  Carlo  simulations  and  to  obtain  stable  statistics  of  the 
thermohaline  fields  under  the  action  of  forcing.  At  the  same  time  the  one  dimensional  case 
capture  the  essential  dynamics  of  thermohaline  compensation;  we  verified  that  introducing  a 
second  dimension  in  simulations  of  the  rundown  problem  does  not  change  the  dynamics  that 
lead  to  the  decay  of  density  gradients.  Only  the  process  of  thermohaline  alignment  cannot 
be  captured  in  one  dimension,  but  a  characterization  of  the  relative  orientation  of  T  and  S 
gradients  goes  beyond  the  goal  of  this  study.  Here  we  limit  our  analysis  to  one  dimensional 
statistics  like  the  density  ratio. 

Numerical  simulations  of  the  nonlinear  diffusion  equations  with  forcing  must  be  run  for 
long  times  to  achieve  stable  statistics.  In  order  to  save  on  CPU  time,  we  limited  our  analysis 
of  the  equations  in  (8)  and  (9)  to  only  one  form  of  nonlinear  diffusion.  We  chose  the  quadratic 
nonlinear  diffusion  that  follows  from  a  shear  dispersion  closure  and  set  /(|Vi?|)  =  1.  Only  a 
few  results  depend  on  this  particular  choice  and  we  will  point  them  out.  In  one  dimension 
the  equations  for  T  and  S  reduce  to 

Tt  =  7  (B2yTy)y-a(T-e(y))  +  G(y,t).  (18) 

St  =  7  (B%Sy)y  +  F(y,t),  (19) 

In  the  temperature  equation  (18),  9  is  the  long  time  averaged  atmospheric  temperature 
determined  by  climatological  data  and  a(T  —  6)  represents  the  atmospheric  feedback  on  the 
temperature  of  the  ML:  the  larger  T  —  6,  the  larger  the  heat  flux  from  the  atmospheric 
boundary  layer  into  the  ocean  ML.  G(y,  t)  represents  all  other  sources  of  heat  such  as  daily 
and  seasonal  temperature  variations.  In  the  following  numerical  calculations  G(y.  t )  is  set 
to  zero  under  the  assumption  that  the  long  term  surface  temperature  is  dominated  by  the 
climatological  forcing  rather  than  by  the  short  term  temperature  fluctuations.  The  parameter 
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a  sets  the  time  scale  needed  for  the  ocean  ML  to  adjust  to  the  atmospheric  temperature  9. 
When  this  time  scale  is  small  compared  to  the  characteristic  time  scale  of  nonlinear  diffusion, 
the  equation  for  temperature  reduces  to 


T  =  0(y) 


(20) 


The  parameter  a  is  determined  by  the  time  averaged  heat  flux  from  the  atmosphere  to  the 
ocean  and  for  a  100  m  deep  ML  it  is  taken  to  be  10-7  s-1  [12].  For  climatological  gradients 
(6  °C  over  1000  km)  the  nonlinear  diffusion  term  is  three  orders  of  magnitude  smaller  than 
the  relaxation  term  and  (20)  is  a  very  good  approximation  of  the  equilibrated  temperature 
field.  Numerical  simulations  will  show  that  this  is  indeed  the  case. 

In  the  salinity  equation  (19),  F{y,t)  represents  the  combined  action  of  salinity  fluxes 
across  the  air-sea  interface  (evaporation  and  precipitation)  and  across  the  base  of  the  ML 
(entrainment  of  thermocline  waters).  The  forcing  on  salinity  F(y,t )  is  modeled  in  the  form 
of  localized  sources  and  sinks  randomly  distributed  in  space  and  time.  Each  forcing  event 
create  a  salinity  anomaly  of  the  order  of  0.01  psu  in  one  day,  equivalent  to  3  cm  of  freshwater 
dumped  in  a  100  m  deep  ML  or  equivalently  to  an  entrainment  of  2  m  of  water  with  a  salinity 
larger /smaller  of  0.5  psu.  We  adjust  the  size  and  frequency  of  the  forcing  events  so  that  we 
obtain  the  equivalent  of  a  precipitation  of  1  m  per  year,  a  value  typical  at  midlatitudes  in  the 
North  Pacific  and  in  the  North  Atlantic  [13],  [14].  In  order  to  conserve  salinity  the  integral 
of  the  forcing  on  salinity  must  have  zero  average  over  long  times, 


lim  f  d t  [  dyF(y,t)  =  0. 
T->oo  70  J  o 


(21) 


We  impose  this  constraint  by  prescribing  F{y,t)  as  the  sum  of  five  positive  and  five  negative 
Gaussians,  all  of  the  same  amplitude,  so  that  their  sum  is  identically  zero  at  all  times.  The 
positions  of  the  Gaussians  within  the  L  =  1000  km  of  the  domain  is  sampled  from  a  uniform 
probability  distribution  and  is  changed  every  day.  This  ensures  that  the  long  time  average 
forcing  is  zero  at  each  point,  even  though  several  Gaussians  can  hit  in  the  same  place  over  a 
short  period  of  time  and  produce  strong  local  salinity  and  buoyancy  gradients. 

The  key  element  of  the  problem  set  in  (18)  and  (19)  is  that  there  is  an  asymmetry  in  the 
large  scale  forcing  of  temperature  and  salinity.  Temperature  is  relaxed  to  some  atmospheric 
climatological  mean,  while  salinity  is  forced  by  random  rainfall,  evaporation  and  entrainment. 
The  idea  that  asymmetries  in  the  thermohaline  forcing  together  with  nonlinear  parameter- 
izations  of  eddy  fluxes  produce  the  observed  large  scale  mean  density  ratio  has  been  raised 
by  Stommel  [4].  Stommel  tested  this  hypothesis  with  an  idealized  two-box  model  of  the  ML, 
but  did  not  attempt  any  comparison  with  observations.  Here  we  explore  the  process  in  the 
context  of  a  continuous  model  of  the  ML  and  we  quantify  its  effect  for  realistic  oceanographic 
parameters. 


4.1  Numerical  algorithm 

For  the  simulations  we  use  dimensionless  variables.  Temperature  and  salinity  gradients  are 
measured  in  terms  of  of  the  climatological  temperature  gradient  (6  °C  over  1000  km),  lengths 
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in  terms  of  the  domain  length  (1000  km)  and  the  time  in  terms  of  the  time  it  takes  to  mix 
tracers  across  the  domain  (3000  years). 

The  numerical  calculations  are  performed  by  integrating  the  non  dimensional  equation 
for  the  temperature  and  salinity  gradients,  obtained  by  differentiating  (18)  and  (19)  in  y, 

(Ty /)t  =  {ByTy)  yy  —  Ol{Ty  —  Oy)  —  H(Ty)yyyy.  (22) 

(Sy)t  =  (BySy)yy  +  Fy  —  l1(Sy)yyyyi  (23) 

The  hyperviscosity  is  introduced  to  dissipate  small  scale  gradients  and  to  allow  longer  time 
stepping  in  the  simulations.  In  this  section  we  are  interested  in  the  effect  of  nonlinear  diffusion 
on  large  scales  and  we  do  not  need  to  resolve  small  scale  structures.  We  use  a  grid  spacing 
of  1  km  and  choose  a  value  for  y  that  filters  out  all  fluctuations  with  length  scales  shorter 
than  3  km.  Global  conservation  of  heat  and  salt  is  satisfied  by  requiring  that  Ty  =  Tyyy  =  0 
and  Sy  =  Syyy  =  0  at  both  ends  of  the  domain  (i.e.,  no-flux  boundary  conditions).  We  solve 
the  system  in  (22)  and  (23)  on  a  discrete  spatial  grid  with  an  explicit  Euler  forward  scheme 
in  time  and  a  dealiased  sine  spectral  code  in  space.  The  time  step  is  set  by  the  decorrelation 
time  of  the  stochastic  forcing  acting  on  salinity,  chosen  to  be  one  day. 

4.2  The  effect  of  stochastic  forcing  on  salinity 

The  vision  of  the  large  scale  T  —  S  relation  in  the  ML  is  that  the  enormously  variable 
forcing  creates  salinity  anomalies  that  are  eliminated  by  nonlinear  diffusion.  The  combination 
of  strong  forcing  on  salinity  and  nonlinear  eddy  fluxes  holds  the  average  density  ratio  to 
a  constant  value.  It  is  instructive  to  study  the  decay  of  a  localized  salinity  anomaly  to 
understand  how  the  T  —  S  relationship  is  established. 

Let  us  study  the  evolution  of  T  and  S  after  the  forcing  has  created  a  large  scale  salinity 
anomaly.  We  prescribe  a  linear  climatological  gradient  (0y  =  1)  and  we  assume  that  the 
relaxation  time  scale  is  so  fast  (large  a  limit)  that  temperature  is  always  in  equilibrium 
(Ty  =  1).  In  this  limit  the  nonlinear  diffusion  equation  for  salinity  reduces  to, 

St  =  [(Sy  —  l)2^]^  —  [A  ( Sy )  yyyy ,  (24) 

We  take  the  initial  salinity  profile  to  have  the  same  width  and  shape  of  the  Gaussian  salinity 
anomalies  created  by  F  in  (19).  Figure  5  shows  the  profiles  of  S  and  Sy  at  time  zero  and 
at  some  later  time.  Salinity  is  diffused  faster  where  the  initial  profile  has  a  negative  slope 
than  where  it  has  a  positive  slope.  This  asymmetry  is  easily  explained  if  we  consider  that 
for  a  gradient  of  given  magnitude  15^1  the  nonlinear  diffusivity  (Sy  —  l)2  is  larger  where  Sy  is 
negative  than  when  Sy  is  positive.  A  look  at  the  profiles  of  salinity  gradients  confirms  that  Sy 
diffuses  at  a  slower  rate  when  it  has  a  value  close  to  1.  As  expected,  in  the  absence  of  forcing, 
salinity  decays  to  the  only  stable  solution  Sy  —  0  (the  solution  Sy  =  1  is  only  semistable),  but 
regions  where  the  salinity  gradient  is  close  to  one  decay  slowly.  We  speculate  that  averaging 
the  solution  over  many  independent  forcing  events  will  reflect  this  asymmetry  and  produce 
a  positive  average  salinity  gradient  (5y);  i.e.,  the  presence  of  a  climatological  gradient  6y  =  1 
breaks  the  symmetry  in  the  decay  of  the  salinity  gradients  and  determines  a  finite  mean 
density  ratio  R  =  l/(Sy). 
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Figure  5:  Evolution  of  a  Gaussian  salinity  anomaly  subject  to  the  nonlinear  diffusion  equation 
in  (24).  The  upper  panels  show  the  initial  salinity  and  salinity  gradient  and  the  lower  panels 
the  same  variables  at  a  later  time.  Only  the  central  part  of  the  integration  domain  is  shown. 
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Some  general  properties  of  the  evolution  of  the  salinity  gradient  are  better  revealed  by 
writing  equation  (23)  in  variational  form, 

(%)t  =  Sj  [||]  +  Fy,  (25) 

where  ${Sy)  is  the  functional 

$(5,)  =  j'  ^H(Sy,y)  +  \v(Syy)2  d y  (26) 

and  H(Sy,y )  is  the  function 

H(sy,y)  =  \s$  -  2-eyszy  +  \e2ys2y.  (27) 

In  (25)  we  introduced  the  variational  derivative  of  the  functional  $(Sy), 

^-=HSy-»Syy.  (28) 

In  the  absence  of  forcing,  the  form  (25)  can  be  used  to  show  that  the  system  has  only  one 
stable  equilibrium  which  minimizes  the  functional  ${Sy),  i.e.,  Sy  =  0  (figure  27).  Furthermore 
the  shape  of  the  potential  is  such  that  the  decay  towards  equilibrium  is  slower  around  the 
saddle  line  Sy  =  6y.  We  then  expect  that,  even  though  the  random  forcing  creates  positive 
and  negative  Sy  anomalies  with  the  same  probability,  salinity  gradients  with  values  close  to 
zero  and  0y  are  more  persistent  and  the  long  time  averaged  salinity  gradient  at  each  point 
settles  to  some  value  between  zero  and  6y. 

4.3  Numerical  simulations 

WV  begin  our  numerical  experiments  of  the  system  in  (22)  and  (23)  by  relaxing  temperature 
towards  a  linear  climatological  gradient;  i.e.  we  set  6y  =  1  in  (22).  The  initial  temperature 
and  salinity  gradients  are  chosen  to  be  zero.  The  model  is  run  continuously  for  3200  years. 

During  the  first  stages  of  the  simulation  the  thermohaline  fields  evolve  from  the  initial 
conditions  towards  a  statistically  steady  state.  Temperature  relaxes  to  the  climatological 
mean  in  a  few  years.  After  that  fluctuations  from  the  equilibrium  solution  T(y,  t)  =  6(y) 
are  negligible.  The  spin  up  time  for  salinity  is  approximately  200  years.  In  the  following  all 
statistics  are  computed  discarding  this  initial  spin  up  time. 

Figure  7  shows  the  mean  thermohaline  fields  averaged  over  3000  years.  In  the  upper  left 
panel,  T  and  5  profiles  are  plotted  as  a  function  of  latitude.  Salinity  develops  a  large  scale 
structure,  even  though  the  random  forcing  is  uniformly  distributed  in  the  domain.  The  long 
time  average  profile  of  salinity  is,  to  a  good  approximation,  S  =  (2/3 )y.  The  analog  of  the 
oceanographic  mean  horizontal  density  ratio  for  this  model  is  R  =  Ty/Sy  =  3/2,  where  the 
overbars  denote  the  temporal  averaging.  A  mean  density  ratio  of  1.5  on  a  scale  of  1000  km, 
the  size  of  our  domain,  is  consistent  with  the  observational  results  of  Stommel  and  Chen  [4], 
[5].  Notice  though  that  our  averages  are  carried  over  3000  years,  while  both  Stommel  and 
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Figure  6:  The  function  H(Sy )  in  the  case  of  a  linear  climatological  temperature  gradient,  i.e., 
By  —  1.  The  dark  line  is  a  possible  Sy  profile. 


Chen  used  climatologies  averaged  over  less  than  20  years.  We  return  to  this  point  at  the  end 
of  the  section. 

Simple  arguments  based  on  a  Reynold’s  decomposition  of  the  thermohaline  fields  are  now 
used  to  interpret  the  density  ratio  of  3/2  obtained  in  the  numerical  simulations.  Let  us 
decompose  S  and  T  as  the  sum  of  a  temporal  mean  plus  fluctuations  about  that  mean;  i.e., 
S  —  S  +  S'  and  T  =  T  +  T' .  Numerical  experiments  show  that  we  can  safely  neglect  T' 
in  the  Reynold’s  decomposition  of  T  and  write  T  =  T  =  6.  Taking  the  time  average  of  the 
nonlinear  diffusion  equation  for  salinity  then  gives, 

Si  =  (5?  +  S£(zs~y  -  2  ev)  +  ~Sy(Sy  -  ey)2)y  +  f.  (29) 


Both  F  and  St  vanish  if  the  average  is  carried  over  a  long  enough  time.  The  sum  of  the 
first  three  terms  in  the  RHS  of  (29)  must  then  vanish  as  well.  Notice  that  this  sum  cannot 
be  a  constant  different  from  zero,  because  of  the  no-flux  boundary  conditions.  Furthermore, 
the  third  term,  Sy(Sy  —  0y)2,  is  likely  to  be  smaller  than  the  first  two,  because  the  typical 
amplitude  of  the  fluctuations  created  by  the  forcing  is  larger  than  the  mean  gradients;  nu¬ 
merical  results  show  that  this  is  indeed  the  case  (last  panel  of  figure  7).  Therefore  the  leading 
order  balance  involves  only  the  first  two  terms.  Setting  to  zero  Syz  +  S'2(3Sy  —  2 9y)  gives  an 
estimate  of  the  mean  salinity  gradient, 


Sy 


2_£ 

3  g* 


(30) 


The  numerical  simulations  show  that  S^2  settles  to  a  constant  profile  different  from  zero,  while 
Sy3  keeps  decreasing  throughout  the  domain.  This  explains  why  Sy  asymptotes  2/3  when 


averaged  over  long  times.  Finally  we  should  remark  that  we  did  not  include  the  hyperviscosity 
term  in  this  analysis,  because  its  contribution  in  the  long  term  balance  is  maintained  small 
by  choosing  an  appropriate  value  for  \i. 

The  Reynold’s  decomposition  analysis  of  the  equations  in  (22)  and  (23)  shows  that  a  large 

scale  mean  salinity  gradient  is  required  if  all  nonlinear  fluxes  are  to  balance  when  averaged 

over  long  times.  This  balance  depends  on  the  particular  form  of  nonlinear  diffusion  chosen. 

That  is  nonlinear  closures  different  from  /(|  VB|)  =  1  in  (8)  and  (9)  would  still  produce  a  large 

scale  salinity  gradient,  but  its  dependence  on  6y  would  not  be  that  given  in  (30).  Stommel 

came  to  the  same  conclusion  for  his  two  box  model  [4].  He  obtained  that,  for  a  diffusivity 

between  the  two  boxes  of  the  form  \By\n,  the  salinity  gradient  becomes  linearly  proportional 

to  the  climatological  temperature  gradient  and  the  mean  density  ratio  approaches  (n  + 1  )/n. 

We  verified  that  Stommel  argument  applies  also  in  the  continuous  limit  for  the  diffusivity 

B2 

y 

So  far  we  showed  thermohaline  profiles  averaged  over  3000  years.  Averaging  over  shorter 
times  gives  results  that  axe  not  in  statistical  equilibrium.  Results  averaged  over  two  different 
subintervals  of  30  years  from  the  same  simulation  shown  in  figure  7  give  a  density  ratio  close 
to  3/2  in  one  case  (figure  8)  and  1  in  the  second  case  (figure  9).  The  differences  arise  because 
over  30  years  the  leading  order  balance  in  (29)  involves  both  St  and  F.  It  is  necessary 
to  average  over  longer  times  to  include  a  number  of  forcing  events  large  enough  to  obtain 
stationary  statistics.  Over  a  3000  year  time  span,  each  point  on  the  grid  is  hit  on  average 
by  106  forcing  events,  while  over  a  30  year  time  span  this  number  is  one  hundred  times 
smaller  and  as  a  consequence  the  average  of  the  forcing  is  10  times  larger:  it  is  the  number 
of  independent  forcing  events  that  sets  the  time  required  to  have  stable  statistics.  In  this 
sense  our  results  are  consistent  with  observations,  because  in  their  analysis  Stommel  and 
Chen  compute  averages  over  a  wide  range  of  longitudes  in  space  so  that  they  include  many 
realizations  of  forcing  events  per  unit  time. 

5  Discussion 

The  main  result  of  this  project  is  that  nonlinear  diffusive  parameterizations  of  the  ML,  which 
use  buoyancy  as  the  driving  field,  can  explain  the  different  values  of  density  ratio  observed  in 
oceanographic  measurements.  At  scales  shorter  than  10  km  the  nonlinear  fluxes  are  so  strong 
that  a  horizontal  density  ratio  of  1  is  restored  in  a  few  days  whenever  a  forcing  event  creates 
an  anomaly  in  T  and  S.  At  larger  scales  the  correlations  between  temperature  and  salinity 
emerge  as  a  balance  between  nonlinear  diffusion  and  thermohaline  forcing.  The  large  scale 
forcing  on  temperature  is  modeled  as  a  relaxation  of  temperature  towards  a  climatological 
mean  profile  to  account  for  the  atmospheric  feedbacks  on  thermal  anomalies.  There  is  no 
reason  to  include  an  analogous  term  in  the  equation  for  salinity  and  the  forcing  on  salinity  is 
purely  stochastic.  The  difference  in  the  forcings  breaks  the  symmetry  between  T  and  S.  The 
large  scale  temperature  gradient  relaxes  in  a  few  yeaxs  towards  the  climatological  mean,  while 
salinity  develops  a  large  scale  gradient  that  on  average  is  proportional  to  that  of  temperature 
even  though  the  forcing  on  salinity  is  uniformly  distributed  over  the  domain.  The  ratio  of 
the  large  scale  mean  gradients  of  T  and  S  is  close  to  3/2  for  the  specific  nonlinear  diffusion 
parameterization  used  in  the  simulations.  This  value  is  consistent  with  observations. 
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T  and  S  averaged  over  3000  years 


T-S  diagram  averaged  over  3000  years 


Gradients  averaged  over  3000  years 


Reynolds  decomposition  of  3y(  I  3yS  -  1  I2  3yS) 


Figure  7:  Results  of  a  numerical  simulation  of  the  system  in  (22)  and  (23)  with  0  =  y  and  1000 
grid  points.  The  upper  panels  show  the  T,  5,  Ty  and  Sy  profiles  as  a  function  of  latitude 
y.  averaged  over  3000  years.  The  average  salinity  compensates  2/3  of  the  climatological 
temperature  gradient.  The  lower  left  panel  is  the  T  —  S  diagram  of  the  averaged  T  and  S 
profiles.  The  lower  left  panel  is  a  plot  of  the  first  three  terms  in  the  RHS  of  (30);  S'^  is  the 
continuous  line,  S'2(3S^  -  2 0y)  is  the  dashed  line  and  Sy(Sy  -  6y)2  is  the  dashdotted  line. 
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T  and  S  averaged  over  30  years 


T-S  diagram  averaged  over  30  years 


Gradients  averaged  over  30  years  ' 


y 


Figure  9:  Results  from  the  same  simulation  shown  in  figure  7  averaged  over  a  subinterval  of 
30  years. 
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In  order  to  show  both  a  density  ratio  of  1  at  small  scales  and  a  density  ratio  of  1.5  at  large 
scales,  we  are  running  numerical  simulations  of  the  equations  in  (18)  and  (19)  with  stochastic 
noise  on  temperature  and  salinity;  that  is  G(y,t)  and  F(y,t)  have  a  white  spectrum  in  space 
and  time.  Preliminary  results  show  that  compensation  develops  over  a  few  grid  points  and 
produce  a  small  scale  density  ratio  of  1,  while  at  large  scales  salinity  compensates  only  part 
of  the  temperature  gradient  and  the  density  ratio  settles  to  1.5. 

In  our  model  the  instantaneous  distribution  of  buoyancy  gradients  at  large  scale  is  de¬ 
termined  by  the  most  recent  forcing  events.  On  the  other  hand  observations  in  the  ML  of 
all  oceans  show  that  buoyancy  fronts  tend  to  concentrate  in  regions  of  Eckman  convergence. 
The  analysis  of  nonlinear  diffusion  models  with  a  large  scale  advection  that  has  regions  of 
convergence  is  a  direction  for  future  research.  The  hypothesis  is  that  the  stochastic  forcing 
creates  buoyancy  anomalies  uniformly  throughout  the  domain  and  the  advective  field  collects 
them  towards  regions  of  convergence.  The  result  is  that  the  average  density  ratio  is  still  3/2, 
but  it  is  all  due  to  a  few  localized  gradients. 

Finally,  it  would  be  instructive  to  implement  nonlinear  diffusive  parameterizations  of  the 
thermohaline  eddy  fluxes  in  large  scale  ocean  models.  It  is  well  known  that  ocean  models  tend 
to  produce  unrealistic  distributions  of  salinity  in  the  ML,  because  there  is  no  feedback  mech¬ 
anism  that  maintains  large  scale  salinity  anomalies  within  reasonable  bounds.  Typically  the 
cure  is  to  introduce  some  ad  hoc  relaxation  to  observations.  Nonlinear  diffusive  parameteri¬ 
zations  obtain  the  same  result  by  introducing  an  indirect  feedback  through  the  climatological 
temperature  gradient.  Furthermore  the  parameterizations  we  have  discussed  in  this  project 
are  the  results  of  closures  based  on  sound  physical  arguments  and  are  not  dictated  by  nu¬ 
merical  necessity.  An  obvious  goal  is  to  test  the  different  nonlinear  parameterizations  against 
the  observed  density  ratio  to  infer  which  are  more  appropriate. 
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